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Introduction to the Newsletter 



Welcome to the Newsletter for Asian aiid Middle Eastern Languages on Computer 
This is the first of many issues of the only publication dealing with Asian and 
Middle Eastern languages on computer. I started this newsletter to help bring 
together all people who are working with any of the Asian or Middle Eastern 
languages on computer because there is so little available on this topic from 
any other journal or organization. This newsletter is published with some help 
from the Center for South and Southeast Asian Studies at the University of 
California at Berkeley^ although there is no formal connection between the 
Center and this newsletter. Sole responsibility for the contents of the 
newsletter rests with the editor. 

We hope to cover a large geographical area, including all of Asia and the Middle 
East. There are certainly enormous differences between the writing systems of 
Japan, India and Saudi Arabia. Although these languages use writing systems 
that di-ffer from each other, each system is certainly different from English. 
Much can be learned from the techniques used to put, for example, Japanese on an 
Apple II, even for someone working with Sanskrit. We are still in the early 
stages of making this happen and it is still an innovative programmer who can 
put Chinese or Korean or Tamil on an IBM PC. In the short history of computers, 
almost all attention has been placed on working with English and other European 
languages. Only recently has it been both possible and affordable to use 
non-western writing systems cn computers. None the less, more and more Asian and 
Middle Eastern scripts are making it into computers, and this newsletter is here 
to help anyone interested to keep up with these exciting events. 

All the computers mentioned so far are microcomputers. The reason for this is 
that we are now in an age when computing power and access are available in a box 
that will comfortably fit onto a desk top for a small ainount of money. The 
newer microcomputers in particular have enough graphics capabilities that 
non-Western writing systems can be easily displayed. This is not to say that 
minicomputers or mainframe computers can't be used, it's just that more work in 
this area seems to be occurring on micros. Nevertheless, we will cover the 
field no matter what the computer involved. 

The potential audience for this newsletter is unusually broad, both in knowledge 
of computers and in possible applications of the programs. Some of the 
subscribers include scholars of the various languages, librarians, historians, 
linguists, business people and diplomats. Some are professional computer 
scientists and others have only recently begun using computers. This makes for 
a wide variety of articles, as can be seen in the articles in this first issue. 

This newsletter can only exist on the contributions of its readers It seems 
that in the last couple of years, many people have ventured into using computers 
for word processing and other such tasks. It is still no easy task to use Asian 
and Kiddle Eastern languages on computer, so if you've found a better way, or 
you have suggestions for programmers on how it should be approached, please 
submit an article on it. 

One of the topics that we'll be dealing with in future issues is computer 
networks. This will become a more and more important method of communications, 
for individuals as well as organizations. I am most interested in receiving 
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manuscripts electronically. If you are interested in submitting your article 
through CompuServe or through some other network, please contact the editor. 

This newsletter is being produced for a minimal profit. My time spent editing 
the newsletter is not compensated for, at least financially. The subscription 
money wall be used to pay for paper, printing, envelopes, mailing labels and 
postage. I don't expect to make any profit, but I also don't want to lose money 
on this venture. I have reluctantly increased the subscription fee to $10 per 
year. This is for several reasons. The size of the newsletter is about twice 
as large as I had originally expected. Postage rates are scheduled to go up 
shortly after this issue will be mailed. The newsletter will be printed rather 
than photocopied. If you have already paid for your subscription, then you got 
a bargain! Everyone else must pay the new rate of $10 per year. Note that this 
rate is for the entire world. The American standard of living is higher than 
the rest of the world, with perhaps a few countries being exceptions. The $10 
that an American pays generates a small profit. This profit will be used to 
subsidize the cost of airmail postage to the rest of the world. 

The majority of this newsletter was written and edited on an Apple Lisa 2 
computer using the Lisa 7/7 Office System. It was printed on an Apple DMP II 
printer. 

A large number of people have helped in various ways to get this newsletter off 
the ground. I would like to thank the following people, hoping that I haven't 
left anyone off: first from the University of California at Berkeley Bruce Pray 
and Barbara Guerlin of the Center for South and Southeast Asian Studies Ken 
Logan of the South and Southeast Asian Library, Sally Sutherland and George Hart 
of the Department of South and Southeast Asian Library, James Agenbroad of the 
US Library of Congress and Jawes Nye, now at the University of Chicago Lastly 
this newsletter would never have gotten out if it were not for the assistance of 
my wife, Diana. 

It has taken longer than I ever thought Lo produce the first issue, but now that 
It is out, future issues will take less time to produce Thank you all for 
being so patient. 



In this Issue 

There are three articles in this issue. The first, on using Wordstar to produce 
diacritics, is by Tony Stewart. This article is a revision of an early draft 
that he was circulating in Chicago. Tony is a doctoral student in the 
Department of South Asian Languages and Civilizations at the University of 
Chicago. He is interested in the history of South Asian Religion Hp can be 
reached at: 713 W. Barry Avenue. Apt 2-N, Chicago, IL 60657. 

The second article, entitled "The Universal Typewriter", is by Professor David 
Wyatt and one of his sons. This article gives a mildly non-technical overview 
of some software that the two developed for working with Southeast Asian 
languages and English. While the software now only runs on a particular brand 
of CP/M-80 computer, it is being moved to the IBM PC. David Wyatt is Professor 
of History at Cornell University. His interests are in the history of Southeast 
Asia, especially Thailand and Laos. He can be reached at; Department of 
History, Cornell University, HcGraw Hall, Ithaca, NY 14853 




Hensletter for Asian and Middle Eastern Lanfluages on conputer. Page 2 

5 



Th3 last article, on multi-lingual word-processors, was written by Lloyd 
Anderson. This article is a slightly modified version of a paper he has been 
circulating among linguists. Lloyd is a typological linguist and has recently 
become involved with the national standards committee for character sets. He 
can be reached at: Ecological Linguistics, 316 A Street, S.E., Washington, D.C. 
20003. 

Regular features are a calendar of events and a section for announcements and 
queries. 

The reviews section in this issue covers organizations, journals, books and 
articles, but no hardware or software. The next issue will have a large set of 
listings of software and hardware. All reviews were written by the editor. 



Anthony Meadow. 
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DIACRITICS ON WORDSTAR* 
SOWm ASIAN LANGOAGE TRANSLITERATION WITHOOT CUSTOMIZED SOFTWARE 

Tony K. Stewart 
University of Chicago 

Diacritics for transliterating the many South Asian languages can 
be produced on your microcomputer with off-the-shelf word processing 
software. Of the many packages, Wordstar® handles the diacritics "prob- 
lem" with perhaps the greatest facility, although not necessarily the 
greatest ease, of any of the more readily available word processing 
programs. Among the products which were personally tested, Benchmark®, 
Word III®, The Final Word®, and WPS-80" word processing programs proved 
themselves capable of producing some manner of diacritics, but with 
rather severe limitations in the production process or final copy. The 
most common drawback was the placement of the diacritic mark itself: 
the diacritic mark tended to fall in full half-line increments above or 
below the character in question.^ This resulting copy proved less 
desirable in most cases than manually typed copy using an IBM Selectric"' 
typewriter with European ball, or some equivalent. Wordstar, however, 
possesses what appears to be a unique feature among off-the-shelf word 
processing programs: the ability to raise or lower the superscript and 
subscript to the desired height. Of course your printer must be capable 
of such fine adjustments (more on that later), but the program itself 
will handle virtually any requirement for diacritics in the translitera- 
tion of Sanskritic or Dravidian languages. The same process will apply 
for creating diacritics to represent other Asian languages, the only 
limitation being the, symbols required on the printwheel or fashioned on 
a dot matrix. To describe the procedure for using Wordstar in the 
production of diacritics, we shall consider four broad topics: 1. How 
to create the diacritics; 2. printer requirements; 3. additional fea- 
tures to facilitate editing and printing with diacritics; 4. user 
definable function keys. 



^It should be noted that I did not have the opportunity to test 
WordPerfect® by Satellite Software International, which superscripts and 
subscripts one third of a line, a distance that should allow for accept-- 
able placement of the diacritics. 

* * * 

Wordstar® is a registered trademark of MicroPro® International; Bench- 
mark® is a registered trademark of Metasoft Corporation; Word III® is a 
registered trademark of Samna; The Pinal Word® is a registered trade- 
mark of Mark of the Unicorn; WPS-eO"" is a registered trademark of Excep- 
tional Business Solutions; Selectric"' is a registered trademark of IBM. 
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I. HOW TO CREATE TRANSLITERATION DIACRITICS ON WORDSTAR 

A. COMMANDS: 

All diacritics can be created using only three commands: super- 
script, subscript, and overstrike. All three commands are located in 
the "print" menu of Wordstar: 

!• '^P'^T = superscript 

2. '^P'^V = subscript 

3* "P"H = overstrike (or backspace) 

The commands are used either singly or in combination, depending on the 
diacritic mark desired. In the case of the superscript ("P"T) and the 
subscript ("P"V), you must always remember to turn the command off after 
the mark has been typed, for these two commands are "toggles." A toggle 
is either "on" or "off" and will remain in that state until reversed. 
The overstrike command ("P"H) is not a toggle and will only affect the 
subsequent character. 

B. SEQUENCE: 

In standard transliteration schemas for most Indian languages, the 
macron, denoting a long vowel requires the superscript (''P'^T) and over- 
strike ("P"H). The dot or period below a character, generally denoting 
a retroflex consonant or 'vocalic r' requires the subscript ("P"V) and 
overstrike ("P"H) in combination. The tilde for nasals requires only 
the overstrike ("P"H) because it is already located in the correct 
position to print above a character and needs only to be backspaced. 
The accent, for the sibilant Is the same as the tilde in that it only 
requires the use of an over5Jtrike or backspace command ("P"H). 

Remembering that the superscript and subscript commands must be 
turned on prior to the diacritic mark and turned off immediately there- 
after^ the sequence of actual keystrokes for producing examples of the 
four marks looks like this: 



1. 


macron = 


"P"T-"P"T"P"Ha producing 


/a/ 


2. 


dot or period = 


"P"V.'P"V"Hd producing 


/d/ 


3. 


tilda 


n'^P'^H" producing 


/n/ 


4. 


accent = 


s'^P'^H' producing 


/s7 



Newsletter for Asian and Kiddle Eastern Languages on Conputer, 1(1) f^age 5 

ER?C 



Diacritics on •ordstar 



Stewart 



Note that those operations requiring superscript or subscript follow a 
sequence which types the diacritic mark first, then the character it- 
self, while the tilde and accent reverse that procedure.* 

C. ONSCREEN: 

You should note at this point, however, that what you see on your 
screen will be different from the above in several respects. First, 
when you invoke the print menu, the '^P command does not stay on the 
screen, but the following command (either "T,"V, or "H) will show on the 
screen. Second, all of the many versions of Wordstar tested on differ- 
ent computers displayed the diacritic in a separate column beside the 
character, rather than on top of or below it. Consequently, you should 
expect a foreign word such as Radha to appear on the screen as 

R"T-"T"Hadh"T-"T"Ha 

before removing the commands. 

The print commands can be removed from the screen (but remain in 
place) by using the "Onscreen Print Display" command ("0"D). After sup- 
pressing the print commands with the "0"D, your screen should look like 
this: R-adh-a. A liberal use of the '^G'^D command will facilitate your 
editing and proofreading as you type. Being a toggle, the "0"D command 
can be used as often as needed to check the spelling of the word, for in 
practice, amidst the clutter of print commands, the actual letters and 
diacritics of the word can get confused or omitted. 



D. SUPERSCRIPT/SUBSCRIPT ROLL (.SR N) : 

It is the ability to raise or lower the super/subscript in minute 
increments which sets Wordstar apart from other word-processing packages 
and makes it well suited for producing diacritics. To raise or lower 
the super/subscript use the dot command (.SR n). Wordstar can adjust 
this roll in 1/48" increments. The default value for the Super/sub- 
script Roll is 3/48" which is good for most applications (.SR 3). If 



^Dividing the print commands into two separate steps — associating 
only one command with each printed part of the new character— eases the 
load on the printer. The diacritic mark itself is either superscripted 
or subscripted, with the accompanying character merely being backspaced 
and overstruck, as opposed to printing the character in a normal posi- 
tion and attempting simultaneously to backspace and super/subscript the 
diacritic. Few printers can tolerate the convolutions of the latter. 
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you are typing in all capitals, however, it may be useful to chc-nge to 
4/48" (♦SR 4) which is a full half-line. If you want to bring the marks 
closer to the character (which will work with only selected print 
wheels) 2/48" (.SR 2) is the limit. 

The super/subscript roll (.SR n) is adjustable only on a per line 
basis, which sinnply means you cannot change it mid-line. Since most 
changes would bo made for capitalized titles, etc., this generally pre- 
sents no problem. Examples of the super/su'- >cr ip;: roll dot comand show 
the flexibility: 



( .SR 


4) 


RADHA; 


KRSNA 






Ridhi ; 


Krsna 

... 


( .SR 


3) 


RADHA; 


KRSNA 






Radha ; 


Krsna 


( .SR 


2) 


RSDHK; 


KRSNA 






Radha ; 


Krsna 

... 



Purists will dislike the compromise, however, between the position of 
the macron and the position of the subscripted dot, since the proper 
placement of the former pushes the latter a bit too low under the 
character, while vice-versa crowds the macron onto the top of the char- 
acter. This inability to switch super/subscript heights mid-line is the 
greatest weakness in the process. 

E. UNDERLINING AND BOLDFACING: 

Unlike a typewriter, which places the underscore directly below the 
word on the printed page, computers generally do not underline with a 
key-stroke, but with print commands. In Wordstar these commands have to 
be watched very carefully because the underscore toggle (^P^S) has to be 
switched off for the actual superscript or subscript, otherwise you will 
get a line under both of those as well. If you simply insert the 
underscore print command (^P^'S) before the word and turn it off after 
the word, the result looks lii.e this: 

Radha ; Krsna 

So, you must remember to turn underline command off before each super/ 
subscripted diacritic and turn it on again before the following letter. 
The key strokes become quite involved and often lead to mistakes. For 
instance, the word Radha when properly underscored will involve the 
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following strokes (the boldface indicates the actual characters to be 
printed) : 

"P"SR"P"S"P"T-"P"T"P"H"P"Sadh"P"S"P"T-"P"T"P"H"P"Sa"P"S; 
while the screen would look .like this: 

"SR"S"T-"T"H"Sadh"S"T-"T"H*Sa"S; 
and it would print like this: 
Radha . 

Clearly, the effort involved is tedious and time consuming, but there is 
an alternative, which is just as effective and eliminates most of the 
hastle: boldfacing. Boldfacing effectively sets apart the word so that 
it is easily recognized as foreign and can be implemented by simply 
turning on the boldface print command ("p"B) at the beginning of the 
word or string of words and turning it of.f again at the desired end. 
Examples in boldface, rather than underline, look like this: 

Radha; Krsna; jnana; KasT. 



F. CAUTIONS: 

There are three- weaknesses in the Wordstar prograoi which can lead 
to problems if not watched carefully. They are not insurmountable, 
rather simply restrictive. 

1. When reforming paragraphs ("e), the macron (superscripted dash) 
can be read by the machine as a hyphen. This results in the breaking of 
a word at the macron with the following vowel appearing on the next 
line. This occurs because the superscript is only a print command, and 
does not differentiate the macron from a hyphen for any of the word- 
processing functions. This potential problem can be avoided by manually 
hyphenating problem words. Manual hyphenation requires close attention 
on the part of the typist, which is to say that the automatic repeating 
paragraph reform ('^Q'^Q'^B) should probably be avoided, it is much easier 
to double-check one paragraph at a time as you move through a document, 
rather than re]:orming the entire document and going back to check word 
breaks. 



2. Another major caution concerns the use of microjustification. 
Due to limitations on the power of Wordstar, the diacritic mark can be 
unevenly placed around the letter when microjustification is on. On 
words with a single diacritic this problem generally does not appear. 
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Similarly, on words with the diacritics sufficiently spread throughout, 
the problem frequently does not show. But on words with diacritics in 
close succession, the microjustification will on occasion jam up the 
sequence. For example, words with a succession of diacritics can look 
like this with microjustification turned on: 

Krsna or Candidasa 
... . • 

and like this when turned off: 

Krsna or Candidasa. 
. . • • . 

Your particular system may respond somewhat differently due to printer 
variation. However, the whole problem can be avoided by switching off 
microjustification either using a dot command (off - .UvT fo; on = .UJ 1), 
or by switching the microjustication off when you install the program. 
Note that microjustification ?s not the same as justifying the margins. 

If you do want justified right margins with microjustification 
turned off, the program will simply divide the number of 
spaces as evenly as possible between the words on the line 
(like this example), rather than by microdivisions, which 
appear much more even to the eye. Although this may not 
appear as even to the eye, it obviates the problem of dia- 
critic placement and still allows for justification. For 
example, the words Radha and Vaisnava are prinned with proper 
placement of diacaritics. 

The remaining option is, of course, simply not to justify the right mar- 
gin (i.e. "ragged right margin"), keeping the microjustication perma- 
nently turned off, like this example. 

As a final option to circumvent the problems mentioned above, you 
can trick Wordstar into doing the diacritics while justifying the mar- 
gins and using microjustification (not to be confused with ttue propor- 
tional spacing/ which Wordstar does not cio). The latest versions of 
Wordstar (3.0-3.3) seem to have improved the quality of the microjusti- 
fication, even though you can still have an occasional ourprise. Each 
machine, however, will respond differently to Wordstar so you will have 
to experiment. I have found that by using a dot command for character 
width (.CW 11), I can right justify with microjustification turned on 
and still get the diacritics to come out properly underneath and overtop 
of the characters. Thi<^ dot command pushes the characters together just 
a little, so that using a ten pitch wheel (i.e. Pica, normally .CW 12) 
you can get text to look almost like true proportional spacing. For a 
twelve pitch wheel (i.e* Elite, normally .CW 10), you can use .CW 9 for 
the same effect, but I do not care for that as much because it makes ^ne 
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lines too long and therefore harder to read. The body of this text has 
been printed using a ten pitch Courier wheel with the character width 
set at .CW 11. 

When you right justify and leave on the microjustification, the 
diacritics in words such as Krsna or Vaisnavas, and in less taxing words 
like Radha or brahmana turn out very nicely, it should be noted that by 
simply switching to the narrower pitch (.CW 11 for pica), the right mar- 
gins are. not as ragged when you do not right justify, nor does the eye 
pick up the discrepancy as quickly when you turn off the microjustifica- 
tion (.UJ 0) and continue to right justify. In short, you will have to 
experiment with your own computer/printer combination to see what mar- 
riage of pitches, justification settings, etc. proves the most fruit- 
ful. 

3. Single spacing: a possible problem. You may find that when 
you attempt to install your diacritics in single-spaced text, they may 
tend to bleed or run together vertically. The default values for Word- 
star are set to assume six lines per inch (=66 lines/page), which is a 
line height of 8/48" (.LH 8). If you are experiencing this difficulty, 
you should adjust your line height to .LH 9 ( = 5.3 lines/inch) or .LH 10 
(=4.8 lines/inch). When you change the line height, remember that you 
will have to change the page length (.PL n) to correspond. With .LH 9 
or .LH 10 you may want to decrease the number of lines in the top margin 
(.MT n) and bottom margin (.MB n). Consult your Wordstar manual for the 
reasons and effects of this alterations, but as usual there is no sub- 
stitute for experimentation. This text has been composed using a line 
height of .LH 9 and a page length of .PL 58. 



!!• PRINTER REQUIREMENTS 
A. GENERAL: 

Without an appropriate printer, the above commands are useless. In 
order to take full advantage of the power of Wordstar in creating dia- 
critics, you must be extremely selective in choosing a printer. There 
are several minimal requirements: 

1. The printer, whether daisy wheel or dot matrix, must be capable 
of printing the superscript/subscript and the overstrike or backspace. 
Good quality ink-jet and all laser printers are sufficiently sophisti- 
cated to handle any Wordstar command. 

2. The printer must be capable of small line increments, prefer- 
ably of 1/48". The fine increment is needed to accomodate the "super- 
script/subscript roll" (.SR n) explained above. Lower quality printers 
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will print the superscript/subscript only at a fixed height, most often 
4/48" or one half line. If this is the case, then the diacritics will 
be too high or low, which not only defeats the advantages of using Word- 
star, but completely eliminates the possibility for single spacing* If 
the printer is so limited, then many of the other word-processing pro- 
grams mentioned in the opening paragraph that handle superscript/sub- 
script would be equally acceptable* 

PRINTWHEELS: 

1* The printwheel for daisywheel printers must be carefully 
chosen* All printwheels will have the dash/hyphen and the period to 
create the macron and subscripted dot but not every printwheel will 
contain the tilde or acute accent. While the tilde does appear on many 
standard 96 character American printwheels, the acute accent will gener- 
ally require a European printwheel, most of which possess a slanted 
apostrophe or acute accent in place of the vertical American apostrophe. 
(Ironically, almost all 96 character American printwheels have the grave 
accent [eg. /a/], which is almost never used in English, while the acute 
accent is often used in words of French derivation.) European print- 
wheels can be obtained from your dealer. - 

2. Please note that some printwheels have both accents and the 
vertical apostrophe. Whether this is preferable is strictly a matter of 
taste, but on such printwheels the acute accent will be placed in a 
different position on the keyboard (using for example the Comm. System). 
Unless your particular computer has the various European alphabets built 
into ROM and you have prograu.med the change, your screen will not show 
the accent, but rather, will show whatever key is in the same place on 
the standard American keyboard. This can be disconcerting, but more 
importantly, it can lead to confusion and a lot of work if you change 
printwheels later. 

3. Although not recommended, printwheels can be customized to 
specification. This is fairly costly. With the limited life of most 
printwheels, especially plastic (and most companies will only customize 
plastic), an untimely break can be devastating. If one chooses to go 
this route, it would probably be more effective to have the desired 
diacritic made up special and not fuss with the superscript/ subscript, 
etc. But one should be prepared to invest considerable money and time 
for testing before going ahead with this commitment. 

4. If you cannot find a type font which is suitable among the 
European printwheels, you can use a standard American printwheel (with 
the vertical apostrophe in place of the slanted apostrophe or accute 
accent) and simply insert the "print pause" command ("^P^C) to replace 
the wheel for the single accent. The "print pause" is a toggle, so 
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remember to insert it directly before and directly after the accent. 
Clearly this method is desirable only on very short or selected docu- 
ments because changing the wheel is very time consuming. 



III. "FIND AND REPLACE" TO INSERT DIACRITICS 

A. rationale: 

Experience has shown that unless you are working with a very short 
document (i.e. two pages or shorter), it is easier to insert the dia- 
critics after the document has been completed. There are a number of 
reasons for this, but simple speed and accuracy are the overriding con- 
siderations. It is much faster to type and edit without the diacritics 
slowing you down and cluttering up the screen. Editing also opens the 
possibility of accidentally removing some of the print commands, espe- 
cially since you would be most likely to edit with the print commands 
off the screen ('"o'D). The solution is easy and efficient, far more 
than the old method of inserting the diacritics by hand: simply use the 
"find and replace" to insert the diacritics. 

B. "FIND AND REPLACE" COMMANDS: 

The command for "find and replace" is "Q^A. Wordstar will provide 
prompts for the entire process, starting with the question "Find?" 
After typing the word or phrase, the program will respond with "Replace 
With?" At this point you type the word with complete diacritics and any 
other print commands (eg. boldface). After entering the new correct 
spelling with diacritics, you will have to choose the method of search- 
ing and replacement, with the prompt, "Options (? for Info)" The op- 
tions (B,W,U,N,G) are important and should be completely understood. 

1. B = Search Backwards - this does exactly what it says. The 
program will only search in one direction, so if you are in the middle 
of a document remember that it will only search in one direction from 
that point. The easiest method to avoid missing any words is to pass 
through the document starting at the beginning for one word and search 
backwards from the end of the document for the next word. 

2. W = Whole Words Only ~ This option will only replace the word 
or phrase if it stands alone. Consequently, words that may appear in 
compounds (eg. Krsna in Krsnadasa) will not be changed if one resorts to 
this command, ff you ignore this option, all appearances of that se- 
quence of letters will be replaced. 

3. U = Ignore Case - This option will select any sequence of 
characters that matches the "Find?" entry regardless of case, but will 
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replace it exactly as typed in the "Replace With?" entry. So if you 
have capitalized a word, you should note it and change it back . inuaily 
or you should ignore this option and make a separate pass with the 
capitalized spelling. The latter is more efficient since it tends to 
eliminate the human error* 

4. N = Replace Without Asking - By using this option the program 
will make the change without prompting; otherwise you will have to in- 
struct the program from the Y/N (Yes/No) prompt, which will appear at 
the top of the screen* 

5. G = Replace in Entire File - If this option is not used, the 
program will search for the first appearance of the word in question and 
stop. This command can speed up the process considerably, especially in 
longer files, and it avoids the need to retype the ^Q^A command to ini- 
tiate the whole process again. 

HINTS AND CAUTIONS: 

1. Because of the system of transliteration and the inability of 
the machine to distinguish on the level of semantics, some words wijul 
just have to be changed manually (for example, the pair rasa/rasa or 
bhava/ bhava)» In order not to miss any occurence, use the "Find and 
Replace" CQ^A) command and make individual decisions as they occur^ 
answering Y/N for each case (clearly you would not use the 'N' or 
"Replace Without Asking" option, but would use the 'G' or "Replace in 
Entire File" option). 

2. Those plurals which are formed by adding 's' can be handled by 
ignoring the 'W' option (Whole Words Only) in "Find and Replace" (^Q^A). 
Otherwise, you will have to make separate passes through the document 
for the singular and plural forms. 

3. If you misspell a word initially, the machine will not pick it 
'^p when it searches to replace diacritic marked words. If you are not 
an accurate typist, then the advantages of "Find and Replace" are limi- 
ted. A dictionary or spelling checker program, especially one which 
allows you to add your own words (like Spellstar*), would be useful* 

4. If you use shorthand while composing your text, the "Find and 
Replace" command will easily change your shorthand into the final full 
form of the word or phrase (eg. typing 'V.' for Vaisnava). Two cautions 
are in order. First, always be consistent in your 'abbreviations, or it 
will have the same effect as misspelling a word. Second, documents with 
manual footnoting may require extensive reformatting in order to accomo- 
date the lengthened text created when the abbreviations are filled out. 
In longer documents with footnotes, I recommend taking the time to type 
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out the full word. However, inaccurate typists may find this an easy 
way to avoid typos and subsequently avoid mistakes while inserting 
diacritics. A footnote program will substantially reduce the problems 
created by the use of abbreviations or shorthand. Unfortunately, 
Wordstar does not have macros which would allow you to insert frequently 
used words or strings of words at a single keystroke; abbreviating and 
using the "Find and Replace" command is similar, but clumsier. 

5. Never forget how literal the machine is. There is no substi- 
tute for accurate proofreading, regardless of the many advant=^ges the 
word-processing software affords the user. 



IV, USER DEFINABLE FUNCTION KEYS 

On many of the newfer sixteen bit micros, such as the popular per- 
sonal computers from IBM, DEC, AT&T, Compac, etc. you can reprogram some 
of the user definable function keys to simplify the insertion of dia- 
critics. The changes can be made when installing Wordstar the first 
time or in a subsequent reinstallation. Because the function keys have 
already been assigned values, you should consider very carefully which 
keys are to be used. There are some inherent limitations due to the 
different hardware architectures and how effectively Wordstar has been 
adapted to the machine in question. For example, on the DEC Rainbow" 
the allowable string of characters is limited to four for any given 
function key. Consequently, to program for the superscript or for the 
subscript, two keys are needed (eg. superscript macron: key 1 = ^P^T-; 
key 2 = "P"T"P"H), which eliminates five keystrokes. For more details 
on how to reprogram function keys in the advanced versions of Wordstar, 
consult the installation section of the manual. 



POSTSCRIPT 

Although Wordstar is an old program, MicroPro has successfully 
adapted it to any number of different micro-computers while improving 
its overall performance. As mentioned above, the program has limita- 
tions which are occasionally an annoyance, but overall, the technology 
allows the user to manipulate the text to suit his needs, rather than 
forcing the user to change his needs to fit the technology. Hopefully, 
in the future, the programs will allow even greater flexibility (eg. 
true proportional spacing using diacritics, etc.), providing us with the 
capacity to generate more professional documents without the fuss while 
substantially cutting the cost of printing. 

* * * 
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Imagine a typewriter on which it might be possible to 
type any alphabet. Or, even better, a polylingual and 
mul tialphabeti c word-processor. As one who works with a 
variety of Southeast Asian alphabets I have long dreamed o-f 
such, but until recently could get no -further than to 
procure Thai typing elements -for my antiquated IBM 
Selectric. Like many, I droolsd over the Xero^: multilingual 
word-processor described in the July 1?84 issue o-f 
Sci ent 1 -f i c Amen can , but struck it from my Christmas list on 
realizing the cost o-f that machine. 

However, i-f one is willing to put up with some 
relatively minor inconveniences, a very serviceable 
universal typewriter and word-processor already is in 
e;:istence, as will become apparent -from some o-f the sample 
printouts reproduced below. With the aid o-f master 
programmer Douglas Wyatt, we now have running a CP/M 



1. Department or History, Cornell University, h'cGraw Hall, 
Ithaca, NY 14353. 

2. New York Dairy Herd Improvement Cooperative, Ithaca, NV 



14850. 
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microcomputer capable o-f both displaying and printing any 
alphabetic script that runs -from left to right- whether or 
not it has complex superscripts and subscripts, and whether 
or not that script has ever even been printed before. 

□ur hardware is one oi the least well-known, "plain 
vanilla" microcomputers on the market (and also one of the 
cheapest -for the features it has): the Lobo MAX-80. C3j 
Quite apart from its ability to run thousands of existing 
software programs, and its capacity to accommodate up to 
eight floppy and/or hard disk drives, it is one of the rare 
machines around that dees not store its displayable 
character set — the alphabets and symbols one sees on the 
video display — in permanent memory (ROM) , but instead 
loads its character set into user — accessible memory (RAM) 
each time one starts up the machine. C4j It is therefore 
possible to change that character set in any way one wishes, 
including to no.T-Roman alphabets. 

Furtliermore, like most microcomputers the Lobo MAX-80 



3. Lobo Systems, Inc., 318 East Gutierrez, P.O. Box 4626, 
Santa BarbBrs^, California 93101; (805) 564-3356. The basic 
unit is priced at ^^945, plus monitor (^^139) and disk drives 
(2 5" drives for ^^485; 2 SSOD 8" drives for ^^1185; 2 DSDD 8" 
drives for ^:1485) . The basic unit has 128K' RAM, runni ng 
under CP/M 3.0; and it will also run the TRS-80-compatibl e 
LDOS. 

4. There are several other microcomputers that, to my 
knowledge, handle their character sets similarly, including 
the Tandy 2000. Are there others? 
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regularly holds two separate character sets in memory — 
normally a "bright" set and a "dim" set, or a regular set 
and one in inverse video — the first green-on-black and the 
second black-on-green. Lobo allows the user to toggle 
between these two sets using any special function key s/he 
may dessignate, just as Lobo also allows one to redefine any 
key on the keyboard (for e;jample, to create a Dvorak 
keyboard in place of the usual QWERTY one). 

Just within the common realm of roman alphabets, the 
four users of this machine each employs his own, 
custom-designed character set; and James Wyatt sometimes 
will run the machine displaying Olde English or even 
Tolfien-ish runes, while ten minutes later Andrew Wyatt will 
be displaying a sans-serif font. Because I sometimes try to 
create bibliographical files ti at emulate OCLC records, I 
have redefined the backslash ("\") as a double-dagger on my 
display. Even this degree of flejiibility is both unejjpected 
and welcome in the microcomputer world. 

But after a good deal of ejip-eri mentation we have gone 
several steps further, to the point where I can now intermix 
roman with Thai or Lao scripts on the display screen. 
Recently, when transcribing an old Lao manuscript on the 
computer I encountered subscripts for which I had not 
provided when creating the Lao character set. I was able to 
add them to the display screen and to the printer in less 
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than an hour. 

Let me descr i be brie-fly how thi s system v^orks , by 
taking the reader through the sequence o-f actions I have to 
take to turn an " ordi nary " mi crocomputer into a "uni versa 1 
typewri ter . " 

The -first requisite is to de-fine a character set -for 
display. The Lobo MAX-80's character sets are composed o-f 
dots (pijiels) in an 8 8 grid. A public-domain BASIC 
program -for de-fining characters -for Lhe MAX-80 is available 
■from the Lobo users' group, MAXIMUL; but this is relatively 
slow, crude, and in-flexible. Douglas Wyatt instead has 
created a character -font editor, EDFONT, written in the C 
programming language (and there-fore translatable -for other 
machines), that allows -for the creation and editing o-f 
character setss to any speci -f i cat i ons , very conveni ent 1 y and 
easi 1 y . C5J 

The MAX-eO's display character set is created and 
stored in groups o-f eight numerals -for each letter in the 
set ; each numeral de-f i ni ng the "lit" and "unl i t " pi .vel s in a 
si ngl e hor i nontal row. Thus the deci mal numeral 131 
(hej:adeci mal 83) when converted to binary -form would 
represent a row o-f pixels in which the -first, seventh, and 



5. Commercially-available so-ftware -from PowerSOFT, running 
under LDOS, also accomplishes these same character-set 
editing -functi'ns. We have not had occasion to test it. 
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eighth pij:eis were lit. Unclear? Decimal 131 in binary is 

lOOOOOil — representing ^-^ in pi:jel«. ! This will 

become even clearGr if we take the representation o-f a 
complete letter o-f the alphabet — let us say the -first 
letter o-f the Thai alphabet: 

Bi nary Deci mal 





OOOOC'OC'O 


0 




0111 1100 


124 


* 


10000010 


130 




0100001 0 


66 


. * . 


0100001 0 


66 


. ^. . . . ^. 


01000010 


66 




01000010 


66 




01000010 


66 



Thus the Thai "k" character will be represented 
as 0,124,130,66,66,66,66,66 -for display purposes. 

To print the same character, printers ordinarily expect the 
pi)?els o-f a character to be de-fined in terms of their 
vert 1 cal , rather than hor i non tal , al i gnment . For printing 
purposes, there-fore, the Thai character above (reading the 
binary numerals vertically rather than horizontally) would 
be 32,95,64,64,64,64,63,0. Douglas Wyatt also has written 
utilities to convert hori::ontal display characters into 
vertical printing characters. For a sample, Figure 1 below 
repAiPduces this paragraph pr i nted in James Wy^t t ' s Runes 
-font, created on an 8 x 8 grid -for display and converted to 
the Epson FX-80's 9 ): 11 printing characters. 
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PU PD/OP PXX 6CQK qXCDCqPXD, PP/OPXP^ U D P / 0 6 D / C ti K Pxqp PXX 
P/ XC^ Ub 6 qX6DCqpXD PU PXL/OXP /O ub PNX/P 

bxpp/qcc, PcpxxD ?\C3 XuP/fuoPcc, 6 c / ^ 0 II X 0 p . buP pp/np/oi; 
poDPu^x^r pxxDxbUDX, PXx PXc/ qxcDcqpxD c^ixbK (Dxcp/oi? pxx 

OOOXDCC^ bxDP/qCCCU D6PXXD P X 6 0 X U D / ^ U B P 6 C C (i ) D U 0 C P 

HA 32,5 5,64,64,64,64,63,0. Puc.«;c.6^ ouc^pp cc^u \c6 dd/PPxo 
oP/c./p/x^ pu quobxDP XUD/|:uoP6C. p/^pccu qxcDcqpxP^ /oPu 
bxpp/qcc. pp/[ip/[HR qxcPcqpxD^. buD 6 ^aPcx, b/t^OQh I pxcuo 

D^.PpUPOqX^ PX/^ P6D6iqD6pX P P / 0 P X P /O S 6 0 X ^ DUCPpW P 0 0 X ^ 

buop, qpxcpxp uo c o 6 & «;p/P bup P/^Pccu cop quobxppxp Pu 
pxx xP^uo b -60' ^ 5 tt pp/op/oi? qxcpcqpxp^. 

Figure 1. — The paragraph above, printed in James 
Wyatt's runes font. 

Upon starting up, the MAX-80 displays its standard 
roman character set on the screen — the usual 
green-on-black, with an inverse-video black-on-green 
character set accessible by toggling Functirn Key 4 (which 
can be de-fined for that purpose). H I wish, let us say, to 
wor^ on entering a Lao chronicle tej?t into the machine, the 
•first thing I do is to change my character set. I run a 
program written by Douglas Wyatt in "C" that loads the 
character set I have called "LAO" into memory. This 
operation retains the usual roman character set in regular 
video, but replaces the i nverse-roman set with a 
regular — video Lao character set. When this is completed, in 
about two seconds, I habitually (and unnecessarily) test the 
result by toggling the F4 key and typing a few words on the 
screen, which are displayed in Lao. I then toggle the F3 key 
to restore roman characters. I then enter my 
word— processi ng program (Perfect Writer) as normal; and, 
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when it is ready, simply toggle the F4 ley again to activc»te 
the Lao character set. That's all there is to it: Anything 
I type -from that point on is in Lao. Should I need to read 
Perfect Writer's status messages on the bottom line o-f my 
screen, I simply toggle back into roman script -for a moment, 
then return to Lao. 

And here we get to two significant deficiencies of the 
system- First, and perfectly obviously, it is not possible 
to have all possible alphabets engraved on the surface of 
every key. One either learns Lao (or Thai, or whatever) 
touch-typing very quickly, or one relies on wall -charts. I 
have found that it ma^:es good sense to maintain alphabetic 
correspondences between alphabets to tlie greatest e;:tent 
possible; thus, the "k" keys of both Thai and Lao are on the 
roman "d", the "long-a" vowel of both is on. roman "k", and 
so forth. Second, a much more serious deficiency occurs m 
t^e cases of all languages with superscr i ptss and subscripts, 
which includes all Southeast Asian scripts of Indic 
origins. It is possible to display these properly placed 
above or below the consonants, but the programming tasks 
involved seemed so immense that we have not thought it worth 
the effort. What we have, therefore, is a system in which 
letters are displayed e::actly as, and in the order as, they 
are typed. (A sample of a Lao input file is given in Figure 
2. ) 
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3 ^?JWc/ 1 5 ^ ")tJt:nt.S ^ Jl C Jjai t f \-D't:M 1 C ^ MoU:: Qof) ^f)tJ o C tJc/.f) C :J nj 1 3 ^m^^h 
V13^?JWc/C;D7j-13nwnj}5'i:irnr:n^?JCUaiTjr:fi^rl7J<WM^^ 

W:rMnS l^l^5!lo.5tJ'J ■)'^J3W^•)^^7J:Jr u-D'«Jirj^Q--Ji:riQ ' l^lUl^ l&i:3-W:iMc/)lSr.5t;rj_^)JUc^uJ^?J Cfl..! iMQ') 
d ' V ^1,^ MO ^s^M 15' 1 J] C ^IWt^Mc/ JlS:: 5= 3 ^^^^ 



This takes a great deal less getting used to than I would 
have ej:pected; and I should get ahead of myself a moment to 
note that this deficiency is a feature primarily of the 
tert-entrv str )e alone. It also means that, because 
superscripts and subscripts are mixed into the main line of 
consonants, it is difficult tn aim for, let a'.one to 
achieve, printed lines that are justified at both margins. 

Having completed entering my Lao te::t, I e::it from the 
word-processing system and go into Microsoft 3ASIC for the 
purpose, first, of converting a monolinear Lao te::t in which 
super- and subscripts are mi::ed in with the consonants into 
a multilinear text in which the super — and subscripts are 
properly placed. This is basically a simple filter program 
which takes my original Lao manuscript input file (which has 
the file-name e::tension .LtiS, for Lao ManuScript) and goes 
through it character — by-character. For each line of input 
file, it creates four lines of an output file: line 1 



Figure 2. — Sample of Lao-language input file, 
with superscripts and subscripts all included 
on the line, appro:: i matel y as it would be dis- 
played on the video screen. 
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contains all superscript tone markers, line 2 contains all 
superscript vowels, line 3 contains the on-the-line 
consonants and vowels, and line 4 contains all subscripts. 
Eacii character is placed into the appropriate line of the? 
output -file, which is given the -file-name e;;tension .LAO- 
The resulting file, if it were to be displayed, would looh 
like Figure 3. 
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M r\ 



4 t 4 t 7 



4 W 



Figure — Lao output file, showing how the input 
lines given in Figure 2 have been broken out into 
■four lines, containing tone markers, superscript vowels, 
consonants, and subscript vowels, respectively. 
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I-f I wish to proceed on to print the -file, I -first load a 
Lao character set into my printer .n this case an Epson 
FX-80, which ha£ the capability o-f accepting character sets 
downloaded -from the host computer. (We also have 
success-f ul 1 y run non-roman character sets as graphics on the 
Epson MX-80, but that is much slower and, on the whole, less 
satisfactory.) A simple BASIC program accomplishes the -font 
change- 

Finally, another BASIC program handles the tricky 
business of printing Lao, basically by making four passes 
across the page to print each line originally input. The 
first pass prints all tone markers for a line. Without 
advancing the paper, the print head then again passes over 
the same space, adding superscript vowels. The paper then 
advances before the consonants are printed below the 
superscripts, and then advances slightly again before 
printing the subscripts. The final product of this process 
comes quickly from the printer looking like Figure 4, 
reproduced at the end of this article. 

In the final printing, of course, the quality of the 
sample can be manipulated much as roman type can, using the 
emphasized, double-strike, condensed, and other print 
features of the printer. For most purposes, this should be 
more than adequate; and at the very least it often will 
allow the printing of scripts that otherwise could be 
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reproduced only in manuscript hand. 

Obviously the "universal typewriter" system described 
here is to some degree hardware-dependent; but it need not 
be. Many microcomputers can be tricked into 1 ooki ng -for 
their character sets in RAM memory, and thus can accept 
user — de-fined character sets; and even printers that cannot 
accept downloadable -fonts usuc,lly will allow the printing ot 
alternate character sets using their graphics mode. I, tor 
one, however, would be hard pressed to think o-f another 
computer printer combination that might be more congenial to 
the needs o-f those using non-roman alphabets than the Lobo 
MAX-80 and the Epson FX-SO. 



ilMslettti fox Asian and Hitfdle Ewtwn Lti^u^ges on Cofiputer. 1(1) 



ERIC 




The Universal Typewriter 



Vyatt and Oyatt 



^ o » ^ ^ 



f-v 



Figure 4- — Final printing o-f portions of the file 
shown in Figures 2 and 3, demonstrating the collapse 
of verti cal spaci ng. 
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Multilingual Word-Processing Systems: 
Desirable Features from a Linguisfs Point of Viet 
by Lloyd B, Anderson 

Ecological Linguistics 
316 A Street, S, E, 
Vashington, DC 20003 

All writing systems in a single flexible program? 

The technology is now becoming affordable so that microcomputers can in 
principle handle all written characters and writing systems in forms almost as 
good as commercial typesetting. The software programming lags behind, however, 
and each new typefont and script is still programmed individually at an enormous 
cost in time. 

It would be nice if we could have a single software program which is flexible 
enough to handle all written languages within a single word-processing 
environment, and which can easily add a new typefont or writing system which a 
user wants. What follows is one attempt to list the "module" program con^Jonents 
needed. You are encouraged to make your own additions, to use such a scheme in 
asking manufacturers how far they can meet multilingual needs, and to send your 
critiques to the author, nho will make it known when substantive updates are 
available. 

Convenience for the user: 

The user should only have to specify from the keyboard which language is desired 
and if appropriate, specify a less common type font or method of keyboard input. 
The program should automatically do the following without further instruction: 

a. Choose the proper character set (with its dot-matrix adaptations to 
particular sizes, screens, printers, and so on). 

b. /?em(Sp the keyboard Xs^ fit the new character set and the standard 
keyboard for that language. If desired^ the normal case could be to allow input 
via transliteration from Latin-letter or other keyboards, where sometimes two or 
more keystrokes will identify a single character, as sh giving Russian iL[. 

c. Specify alphabetical order (sorting order) of the characters which will 
govern all indexing programs (unless for bibliographies and library catalogs, 
where different writing systems and their transliterations are to be mixed, but 
following the single alphabetical order of the receiving language). 

d. Handle the physical arrangement of base characters and diacritics, 
accents and vowelings. This includes the order which might be left-to-right, 
right-to-left or top-to-bottom. It includes the accent marks above letters in 
European alphabets, which the program must raise to higher levels above capitals 
and letters with vertical extenders. Where this is not done, as on most 
typewriters and printers, we see less than ideal results as over the capital 
letter E following: 6 is ok, but t is not so good. 

Fuller flexibility is needed for vowels in Semitic scripts and in India and 
Southeast Asia. Here a single vowel may be represented by components to the 
left of a consonant, above, below or to the right of a consonant^ or by a 
combination of several of these at once. Thai adds tone marks as well, giving 
multiple diacritics above a single letter. 
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Letters may vary their form according to context, as in many ••conjunct" 
characters of Indie alphabets, or the linked letters of Arabic. This too should 
be automatic. 

Linear order of pronunciation, typing and code transmission may not 
correspond to linear order of print. An example of this is in Hindi, where the 
vowel /i/ is written before a cluster of consonant letters even though it is 
pronounced after them. 

All of these should be independent modular concepts. For the next generation of 
word-processing programs, there should be a standard "interface" using concepts 
like "line of text" independent of the font size or direction of writing, and 
separate concepts for number-of-characcers-per-line and nufrier-of-lines-per- 
page; using a concept "above-letter-accent" which does not specify exact 
position, and a separate concept of "extender-height" which is used to calculate 
how high to place an accent (or a set of categories for base characters of 
different heights matched to a small list of accent heights). 

The user's choice should be as simple as pressing Home or Alternate keys to 
switch between two previously designated writing systems, fonts, or the like, 
and a simple way to specify in advance just which writing system or font is 
referred to by each of such keys. 



Where are we now? 

Keyboard- /emapping programs are now available (separately, as Prokey, or 
packaged as part of larger programs). The keyboards we want are not already 
prepackaged, of course, so we must construct them. 

Large character sets and exotic alphabets are available from some special 
suppliers. Unfortunately, it is not yet generally possible to integrate these 
with off-the-shelf word-processing or database programs, partly because most of 
these are restricted to 7-bit ASCII codes and allow only 95 characters, not 
permitting the additional 96 character codes for printing characters which are 
available in an 8-bit ASCII code. 

In most programs for exotic languages, each combination of base character with 
an accent or other diacritic must be separately specified as a complete 
character. As graphics capabilities become more general (we're just on the edge 
of it now), the accent marks can be placed on a separate "layer" which is merged 
with the base character in real time on the screen or in printing, but remains 
separate in programming. Some programs already permit this. 

With more complicated alphabets and with Chinese characters, a much higher 
screen resolution is needed so they can be distinguished from each other. A 
compromise does now exist, in that printer detail can be of much higher 
resolution than what is seen on the screen under certain programs (this is true 
for printers with "graphics" capabilities). At low screen resolutions^ only a 
small part of a page could be viewed at any one time if the characters are to be 
distinguished. Screens with much higher resolution (1024 by 1024 pixels (dots) 
instead of the 320 by 200 of the IBM PC, for example) are now becoming 
available, but the software "drivers" required to operate these screens cannot 
work with most of the existing packages of software. So far, they are all 
intended mainly for people doing their own "graphics" programming, which 
excludes almost all linguists. 

Size variation is not a trivial matter. Simply multiplying each pixel by two or 
four either vertically or horizontally or both can indeed produce oversized 
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characters, but they have a "stairstep" forni reflecting the discrete dots in 
their origins. A more sophisticated "zoom" is available in some programs, which 
operates only a highly detailed space mapped inside the computer, by dropping 
out pixels as one zooms bock from the image and it becomes sfnoller (and 
correspondingly a larger fraction of the page in the computer's memory can fit 
on the screen). 

Typefont variation is extremely expensive to include at present, since each dot 
of each dot matrix must be specified and that requires a lot of memory. There 
are other methods of specifying letters, in terms of pen-strokes or brush-stroke 
shapes. Sse "Digital Typesetting" by Charles Bigelow in Scientific American, 
August 1983, especially pages 118 to 119. By factoring out separate variables 
in the type style such as stroke width or slant or presence versus absence of 
serifs, fonts can be specified in more flexible ways. This requires an 
extensive program to decode the parameters and build the characters "on the 
fly", each time they are used. Such programming has been developed for Latin 
letters by Donald Knuth in the TgX system, and has even been applied to Chinese 
characters. In this way, a larger sized character is derived from the same 
formula as the smaller one, and is equally smooth in design since the formulas 
generate a smooth sequence of dots no matter what the character size. 

Typesetting is also on the edge of becoming cheaply and widely available, and a 
number of programs exist for this. Some are very sophisticated, such as Donald 
Knuth 's system mentioned above, but do not permit interactive editing. 

The above notes are intended as an aid to help you when inquiring about what a 
program can do for you. 
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Reviews of Organizations, Books, Journals and Articles 



In this section, wg will regularly review all resources of interest except 
hardware and software, which will be covered in another section. If you know of 
any books, journals or articles which would be useful to others working with 
Asian and Middle Eastern languages on computers, please send information about 
them to the editor. If it's an article, a copy of it would be appreciated. We 
also want to mention any or^ nizations of interest, which is where we'll begin 
this issue's coluinn: 



Organizations 



There are several organizations which will be of interest to at least some 
readers. The Association for Literary and Linguistic Coiputing has existed 
for well over a decade now. Its primary aims are to "further literary and 
linguistic research by computer and to provide a means of communication for all 
those concerned with such research". The ALLC has two publications: the first 
is the ALLC Journal, a biannual refereed journal carrying articles on the 
results of literary and linguistic computing. The second is the ALLC Bulletin, 
published three times a year. It is less- formal and carries articles on 
methodology, software and work in progress. Membership for individuals in the 
ALLC costs &10 and includes both the Journal and the Bulletin. For further 
information contact: Dr. T. Corns, University College of Korth Wales, Bangor, 
Gwynedd, LL57 2DG, United Kingdom. 

While most of their members work with European languages, there are two 
specialist groups of interest to our readers. One specializes in Hebrew and is 
managed by Professor Y.T.Radday of the Technion in Haifa, Israel. The other, 
formed at the last annual conference of the ALLC, specializes in Languages and 
Literatures of the Indian Subcontinent. It is chaired by Dr. Peter Schriener of 
the Seminar fiir Indologie und Vergleichende Religionswissenschaft, Universitat 
Tubingen, Miinzgasse 30, 7400 Tubingen, West Germany. Please see the 
Announcements Section in this issue regarding the status of this group. The 
ALLC will of interest to anyone involved with using computers in a scholarly 
context. It has a broad international base of members, with many from Europe. 



The Association for Computers and the Kuaanities was organized to encourage 
"computer aided research in language and literary studies, history, anthropology 
and related social sciences as well as the use of computers in the creation and 
study of art, music and dance. In addition, the Association fosters the 
development of computer technology and techniques necessary for these studies". 
Benefits of membership include a quarterly newsletter covering activities of the 
Association and other topics, the option of a reduced subscription rate to the 
journal Coiputers and the Huianities and reduced registration fees for 
meetings of the ALLC and the International Conference of Computers and the 
Humanities which the ACH organizes. Annual membership costs $15. For more 
information, contac^t: Harry Lincoln, Association for Computers and the 
Humanities, Mutic Department, SUNY Binghamton, MY 13901. The ACH, like the 
ALLC, will be of interest to anyone using computers in a scholarly context. 
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Its membGrship is predominatGly American, but there are members from the rest oA 
the world as well . 



The Chinese Language Coiputer Society was formed in 1976 with the goal of 
advancing "the science and technology of information processing in Chinese and 
other languages containing a non-trivial Chinese component" and promoting "the 
exchange of information on Chinese language information processing in the best 
scientific and professional tradition". The languages that members of the CLCS 
are primarily interested in are Chinese, Japanese and Korean. This society has 
a computer science rather than humanities orientation. The benefits of the 
CLCS are a quarterly journal entitled Coiputer Processing of Chinese and 
Oriental Languages, a newsletter and international conferences (papers 
presented at them are usually available in Proceedings). Annual membership 
costs $15. For more information, contact Professor Helena Gin Wong, CLCS 
Membership Chairman, Computer Science Dept., San Francisco State University, 
1600 Holloway Ave., San Francisco, CA 94132 or telephone (415)469-2858 or 
469-1008. The CLCS is oriented towards computer scientists and programmers, 
but there are articles published in their journal that can be read by anyone 
with a basic familiarity with computers. 



The Coiputer Society of India is the national computer society in India and 
corresponds to the Association for Computing Machinery in the United States. 
Benefits- include a newsletter, the CSI CoMunications, published monthly and a 
journal, Coiputer Science and Infonatics, published twice a year. Both are 
published in English. Membership inside India costs Rs. 60/- and outside India 
it costs $30 (which includes the cost of airmailing all publications). For 
further information, contact the Computer Society of India, Institution of 
Engineers Building, 15 Haji Ali Park, Bombay 400 034, India. The CSI is a 
professional society oriented towards programmers and computer scientists. 
There are only occasional articles on Indian languages on computer. 



The Association for Coiputational Linguistics was founded for persons 
interested in using computers for language research (including phonetics, 
phonology, syntax, semantics, discourse, ...), applications (including 
translation, office automation, lexicography, ...) and scholarly investigations 
(including stylistics, content analysis, ...). Benefits include a newsletter and 
a journal, Coaputational Linguistics, published quarterly, and an annual 
meeting. Annual membership costs $20 (I think). For more information, contact 
Donald E.Walker, ACL, Bell Communications Research, 445 South Street, 
Morristown, NJ 07960 or telephone (201)582-7406. The ACL is a specialist 
organization that seems to appeal primarily to computer scientists, especially 
those working with natural languages and artificial intelligence. The articles 
in their journal are sophisticated and can generally only be read by advanced 
linguists and computer scientists. 



McHslcttcr for Asian and Hiddle Eastern Lanfluages on Conputcx, 1(1) 



Page 



Books 



Not very surprisingly, there are few books on Asian and Middle Eastern languages 
on computer. There are several books that are not directly relevant, but still 
useful. If you know of any others^ please let the editor know of them. 



A Guide to Conputer Applications in the ttm nities, by Susan Hockey (Johns 
HopKins University Press, Baltimore, 1980) is an excellent introductory work 
that covers much of what has been done in the last two decades in literary and 
linguistic computing. There ar& only a few references to any work with 
non-western scripts, but on the other hand it does cover a large range of which 
sorts of things have been done with European languages. The chapter titles are: 
Introducing Computers; Input and Output; Word Indexes, Concordances and 
Dictionaries; Vocabulary Studies, Collocations and Dialectology; Morphological 
and Syntactic Analysis> Machine Translation; Stylistic Analysis and Authorship 
Studies; Textual Criticism; Sound Patterns; Indexing, Cataloguing and 
Information Retrieval; and How to Start a Project. Also included is a useful 
bibliography. There have been more recent developments that make some of the 
material a little dated, but it is an excellent place to start for anyone 
interested in or involved with literary and linguistic compu.ting. 



Text Processing: Algorithms, Languages and Applications, by Allen B. 
Tucker, Jr. (Academic Press, New York, 1979) is a somewhat dated but still 
useful introductory work on literary and linguistic computing. The chapter 
titles are: Introduction to Text Processing; Introduction to PL/I for Text 
Processing; Introduction to SNOBOL for Text Processing; Overview of Text 
Processing Packages and Applications; and Literature Review, This book would be 
most helpful to some >ne who is interested in literary and linguistic computing, 
but does not know much about computers. The articles and books cited were all 
published before 1979. 

TeI and METAFONT: Mew Directions in Typesetting, by Donald Knuth (Digital 
Press, Bedford, Mass., 1979) is already a classic. It describes a typesetting 
system (called TeX) and typeface designing system (called METAFONT), Both were 
conceived of, designed and (primarily) implemented by Donald Knuth, who has also 
been writing the series The Art of Coiputer Prograaaiing. Both systems are 
batch oriented and probably cannot be made interactive (some claim for good 
reason too). Even so, there are many concepts in these systems which have 
changed the way most people look at typesetting and typeface design. This book 
explains the concepts behind the two systems. TeX has been used to typeset well 
over a dozen books and is used to typeset lots of mathematics, one of the most 
difficult tasks you can give a typesetter. Some work has been done with Indian 
languages and Chinese, but much remains to be done to make TeX usable for Asian 
or Middle Eastern languages. 

Anotyii>r recent book about these systems, also by Donald Knuth, is The TElbook 
(Addison-Wesley, Reading, Mass., 1984). This book explains how to use TeX for 
both beginners and advanced users. It is easy to read and is even humorous in 
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parts. I should add that TeX and METAFONT are in the public domain. For more 
information contact the TeX Users Group, c/o American Mathematical Society, PO 
Box 1571, Annex Station, Providence, RI 02901. The TeX system is now quite 
common on university computer systems in the United States and Europe, 
especially those with strong mathematics departments. The code for the TeX 
system is available for a nominal fee from the TgX Users Group. 

Coded Character Sets, History and Developient, by Charles Mackenzie 
(Addison-Wesley, Reading, Mass., 1980) is (contrary to its title) primarily just 
a history' of the IBM EBCDIC (Extended Binary Coded Decimal Interchange Code) and 
the ASCII (American Standard Code for Information Interchange) character codes 
used to represent characters in computers. It does not cover any other 
character codes at all, nor are there any references to other books or articles. 
This is frustratiiig because there are so few in existence. It does explain in 
great detail why the EBCDIC and ASCII codes are structured the way they are and 
some of the problems one runs into when converting from one to the other. It 
would be useful to anyone designing computer character sets as a source of 
background information. 



Word Processing Handbook, by Ivan Flores (Van Nostrand Reinhold, New York, 
1983) describes in great detail the components of a word processing system, both 
software and hardware, the various features included in almost all word 
processors and a chapter on how to evaluate word processing (WP) systems. This 
book would be useful to anyone designing or implementing a word processirig 
system because it lists and explains the features of a WP system in more detail 
than I've ever seen anywhere else. Features are described as a user would see 
them, not from the viewpoint of a programmer or designer. Features of the mof^t 
popular word processors are described too, giving a feel for what is popular 
(and presumably comfortable) for most people. There is no discussion of any 
languages other than English and no references to any other books or articles. 



Journals 



Several journals have been described in the section on organizations. Their 
names once again are: 

- ALLC Bulletin and ALLC Journal, both published by the Association for 

Literary and Linguistic Computing 

- Computer Processing of Chinese and Oriental Languages, published by 

the Chinese Language Computer Society 
" Coiiunications of the Coiputer Society of India and Coiputer 

Science and Informatics, both published by the Computer Society 
of India 

- Coiputational Linguistics (formerly American Journal of 

Computational Linguistics 

In addition to these journals, there are several others of interest. 

Computers and the Humanities has been published for over fifteen years now. 
While many of the articles are only of peripheral interest, there are still 
several articles published each year that directly concern Asian or Middle 
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Eastern languages on computer. Many of the other articles will be of interest 
to anyone doing literary or linguistic computing. In addition to articles, they 
also regularly review books and software as well as publish a list of scholars 
active in the field. By joining the Association for Computing ir the 
Humanities, you can receive a discouxited subscription to it. For jiore 
information about the journal, contact the publishers: Paradigm Press, Post 
Office Box 1057, Osprey, FL 33559, or the ACH. 

SCOPE (Scholarly CoMunication: Online Publishing and Education) is a 
newsletter which is also published by Paradigm Press. It comes out every two 
months and carries news on new books and journals of interest to humanities and 
social science users of computers. Topics covered include interesting hardware, 
software and databases and upcoming mootings. It complomonts CtM rathor 
nicely. Subscriptions are $47 per year and ^ reduced rate is available to 
members of the Association for Computing and the HuroardtiGS. For more 
information, contact: SCOPE, Paradigm Press, Post Office Box 1057, Osprey, FL 
33559, or the ACH. 

Technical Japanese Translation is a unique newsletter published by Dr. Dojiald 
Philippi. Its primary audience is technical translators of Japanese, but anyone 
interested in Japanese on computers should also subscribe to it. It covers 
developments using the Japanese language on software and hardware from a user's 
point of view in addition to topics primarily of interest to technical 
translators such as word lists, reviews of dictionaries and other books. A six 
month subscription costs $20 in the US. Subscribers in other countries should 
contact the editor. The first year's back issues (numbers 1 through 15) cost 
$20 in the US and $25 for Japan and Europe. For further information, contact 
the editor: Donald Philippi, 715 Tenth Ave., San Francisco, CA 94118. His 
telephone number is (415)752-7735. 

The Seybold Report on Word Processing is a specialized and expensive 
newsletter that reviews word processing systems in great and exhaustive detail. 
So far as I know, they have not reviewed many WP systems that work with Asian or 
Middle Eastern languages, but the reviews are the best I have ever seen. If 
anyone is contemplating buying, using or designing a system or piece of software 
for word processing, this newsletter cannot be too highly recommended. There 
are several other Reports of potential interest, including the i'eybold Report 
on Office Systeis ($60/year) and the Seybold Report on Profest\ional 
Coiputing ($120/year). For more information, contact: Seybold Publications, 
Box 644, Media, PA 19053. Their telephone number is (215)565-2480. 

The newsletter of the Society for the Study of the Indigenous Languages of 
the Americas has a regular column for computer users. While the particulars 
may not be directly applicable to Asian or Middle Eastern languages, there are ' 
topics discussed that are still quite useful. For more information, contact the 
editor: Victor Golla, Department of Anthropology, George Washington University, 
Washington, D.C. 20052. 

The Siall CoMputers in Libraries Newsletter has occasional articles that deal 
with Middle Eastern (and hopefully Asian) languages in libraries. A one year 
subscription costs $20 in the USA, $25 in the rest of North America and $35 in 
the rest of the world. For more information, contact: SCIL, Graduate Library 
School, College of Education, University of Arizona, 1515 E. First Street, 
Tucson, AZ 85721. They can be reached by telephone at (602)621-3566. 
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Articles 



In the course of tracking down people who are working with Asian and Middle 
Eastern languages, several people have been kind enough to send »e articles of 
interest. If you have written any articles of interest to the readers of this 
newsletter, please send a copy of the article to the editor (or at least a 
'Reference to it) so that others can be told of it. 

James E. Agenbroad^ "Character Sets: Current Status and East Asian Prospects", 
Journal of Library Autoiation 13(1): 18-35. March 1980. Recent standards for 
East Asian character sets are described here, along with a description of their 
potential usability in libraries. Character sets for English (ASCII and 
others), Cyrillic and Greek are also illustrated. 

Mohammed H. Aroan, "Use of Arabic in Computerized Information Interchange'*, 
Journal of the American Society for Inforiation Science 35(4): 204-210. 
(1984). This article discusses the problems of putting Arabic on computer and 
the new Arabic character standard (CODAR-UFD). A list of vendors of Arabic 
terminals and computers is also given. 

Joseph D. Becker, "Multilingual Vord Processing", Scientific American 251(1): 
96-107, (July 1984). The problems of developing a word processor that will work 
with most of the world's languages are described here. The article clearly 
describes why a truly multilingual word processor is so difficult to implement. 

R. W. Bemer, "Inside ASCII - Parts I, II and III", Interface Age, May 19Y8: 
pp. 96-102, June 1978: pp. 64-74 and July 1978: pp. 80-87. This series of three 
articles in a popular computer magazine explains the details of the ASCII 
character set, which has been slightly modified into International Standard 646 
This character set is the basis of all national character sets and is therefore 
of interest to everyone. These articles, by one of the people involved in 
designing several character sets, also describe character sets for Cyrillic, 
Greek and Japanese. 

Winand M. Callevaert, "The Leuven Text-Composition System in Devanagari", 
Computer Science and Informatics, 13(2): 23-30. (1984). This article 
describes a system used for Hindi, Sanskrit and Gurmukhi. The keyboards and 
character codes used for each of the languages are described. 

Y. Choueka, H. Cohen, J. Dueck, A.S. Fraenkel and H. Slae, "Full Text Document 
Retrieval: Hebrew Legal Texts", Proceedings of the Symposium on Information 
Storage and Retrieval, April 1-2, 1971, pp 61-79. (Another ACM publication). 
An early paper describing the initial phase of the Responsa project. This 
project aims to store and make easily available an enormous quantity of Hebrew 
and Aramaic texts. The problems of dealing with Hebrew are well described. 

International Business Machines Corporation, "IBM 5291 Display Station: National 
Language Requirements (Thai) Operator's Guide: RPQ 8D0025", CA09-1665-0 (File 
Ho. S5250-14). This document describes the keyboard layout and graphics for a 
Thai version of their 5291 terminal. 

J. B. Millar and M.C. Newey, "An Arbitrary Font Printing System", Proceedings 
of the Digital Equipment Computer Users Society, Tovnsville, Queensland, 
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Australia. (August 1977). This article describes a system that can be used to 
print in arbitrary fonts, including Devanagari. The font editor is also 
described. 

J. B. Millar and W.W. Glover, "Synthesis of the Devanagari Orthography", 
International Journal of Han-Hachine Studies, 14:423-435 (3981). This 
article describes an application of the arbitrary font printing system described 
in the article previously »entioned. The system was used to prepare material in 
Nepali. The system implementation and algorithm for synthesizing characters are 
described. 

J. S. Millar and H. Oasa» "Proposal for ASCII coded phonetic script". Journal 
of the International Phonetic Association. 11:62-74. (1981). Thic article 
proposes a character set for the International phonetic alphabet. 

Ichiko Horita, "Japanese Character Input; Its State and Problems", Journal of 
Library Automation 14(1); 6-23. March 1981. Descriptions of most, if not all 
methods for entering Japanese into a computer ana ,heir advantages and 
disadvantages are given. Problems of working witii kanji inside a computer are 
also described. 

R.H.K. Sinha and Arjun Raman, "A Modular Data Terminal for Indian Languages", 
Computer Graphics 14(1&2): 39-72. July 1980. A detailed design of a terminal 
capable of working with all of the Indian languages is described. Different 
methods of encoding the languages are described as well. Computer Graphics is 
a publication of the Special Interest Group in Graphics of the Association for 
Computing Machinery (ACM), the national computer society of the USA. 

There are short articles of some interest occasionally seen in Infovorld, a 
weekly magazine that covers the computer business and new products. Some recent 
articles along these lines included: "Japan on 15K a Day", pp. 66-58 (28 May 
1384) and "Saudi Arabia Embraces Micros", pp. 25-26 (23 April 1984). The 
publisher's address is: Infoworld, Circulation Department, 375 Cochituate Road, 
PO Box 837, Framingham, MA 01701. 

As this issue was being completed, a specia\ issue of Computer^ a monthly 
magazine produced by the Computer Society of the IEEE (Institute of Electrical 
and Electronic Engineers), has just come out on Chinese/Kanji Text and Data 
Processing. There are eight articles on this topic, including an article by 
Joseph Becker who wrote the Scientific American article mentioned above. 
Anyone interested in Chinese or Japanese on computer should get a copy of this 
issue. 
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Calendar of Events 



February 25-28, 1985 - International Conference of the Chinese Language 
Computer Society will be held in San Francisco, California at the Golden Gate 
Holiday Inn. For further information, contact: Dr. Daniel T. Chang, Local 
Arrangements Co-Chairman, A4S/098, IBM Corp., 1126 Olive Branch Court, San Jose, 
CA 95120. (408)268-7873. 



June 22-24. 1985 - Fifth International Conference on Databases in the 
Humanities and Social Sciences will be held at Grinnell College. Hajor 
topics will be opportunities for database users provided by microconputers, 
videodiscs, communication networks, online catalogs, and electronic bulletin 
boards. The registration fee is $125. For further information, contact: F, 
Moberg, ICDBHSS/85 Coordinator, Grinnell College, PO Box 805, Grinnell, lA 
50112. (515)235-2570, 



June 26-28, 1985 - 1985 International Conference on Coi«puters and the 
Humanities will be held in Prove, Utah at Brigham Young University. For 
further informatioa contact: Randall Jones, Humanities Research Center, 3050 
JKHB, Brigham Young University, Prove, UT 84602. 
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Announcements and Queries 



Dr. Peter Schreiner is the chair of a new specialist interest group in 
Languages and Literatures of the Indian Subcontinent, which is one of many 
groups of the Association for Literary and Linguistic Conputing. He has 
written that "during the initial phase I see my task mainly as that of gathering 
information about people, institutions, projects, etc., to be exchanged and 
handed on whenever I am approached. 

"Such information may concern computer-assisted work on any Indian larguage 
or text of any period CVedic, classical Sanskrit, Prakrit, Pali, modern 
languages, dialects, oral tradition, etc.), including standards, routines, 
programs, etc. for the transliteration of Indian scripts and/or the handling and 
output of texts in original scripts (e.g. Devanagari); and such work may be of 
any type (lexicography, concordances, language teaching, bibliography, etc.). 

"If you are working in this field, kindly let sne know the following details 
concerning your project: 

- Names and addresses of people involved 
Title and brief description of the project 

- List of publications (published and forthcoming) based on the project 

- Information on hardware and software used 

- Lists of texts available in machine-readable form indicating whether 
they are available for outsiders and under which conditions. 

"Do not hesitate to comment on what you expect from this specialist group or 
to suggest how it should function - we are just beginning tha work! Thank you!" 

Dr. Schreiner can be contacted at the the following address: 

Seminar fiir Indologie und Vergleichende Religionswissenschaft 
Universitat Tubingen 
Miinzgasse 30 
7400 Tubingen 
West Germany. 

Maurice J. Bauhahn is interested in hearing from anyone working with Khier or 
Thai on coHputers. He can be contacted at the following address: 

Maur. T Bauhahn 

c/o Voi id Vision Foundation of Thailand 
G.P.O. Box 1717 
Bangkok 10500 
Thailand 
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Policies 



The Newsletter for Asian and Middle Eastern Languages on Computer is published by Bear River 
Systems. The entire contents of this Newsletter are copyright (c) 1985 by Bear River Systems. 
For rights to reproduce the material in this newsletter in any form or media, please contact the 
editor. 

A one year subscription to the newsletter costs $10 per year throughout the world and always 
includes the entire volume. Subscriptions from the USA help subsidize subscriptions to the rest of 
the world. Issues are mailed first class in the USA and airmail to the rest of the world. To order a 
subscription, please send a check or money order for $10 (in US funds) payable to "Bear River 
Systems" to the following address: 

Bear River Systems 
POBox 1021 

Berkeley, CA 94701 USA 

Telephone: (415)644-1738 

The newsletter is produced on a part-time basis and there is net sufficient time to deal with various 
forms of red tape. Please therefore send a check or money order, rather than a purchase order. If 
there is no other way to order the nev/sletter than through a purchase oider, then add $5 to cover 
the additional work required to process the paperwork. 

Exchanges with other newsletters will be considered Flease contact thr editor. 

This newsletter was produced using an Apple Lisa (aka Mac XL), MacWrite, Microsoft Word and 
the Apple LaserWriter. 



Submitting Articles 

Tc submit an article for publication, please send one copy of the article in "camera-ieady" form to 
*he editor at the following address: 

Editor, Newsletter for Asian and Middle Eastern Languages on Computer 
Center for South and Southeast Asian Studies 
260 Stephens Hall 
University of California 
Berkeley, CA 94720 USA 

Electronic submission of articles is encouraged Please contact the editor regarding this. 
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Editor's Page 



Welcome to the second issue of the Newsletter for Asian and Middle Eastern Languages on 
Computer. In this issue we have two articles: Jay Rubin extends the procedures described by 
Tony Stewart in the previous issue to SSI's WordPerfect and Microsoft's Word. Both are well 
known word processing programs for the IBM PC and compatibles. George Hart has written an 
article about his work developing Indian fonts for the Apple Macintosh computer. 

In this issue, I have begun listing software and hardware of potential interest to the readers of this 
newsletter. These are simply listings of information provided by the manufacturer or from some 
other source. They are not reviews. In general, I do not have access to enough machines to try 
many of these packages, nor the time. If some of you have used any of these or other products 
many others would be interested in hearing of your experiences. A review could be as short as a 
paragraph or as long as several pages. Send your reviews to the editor at the address given on the 
inside cover. I will publish articles written by developers of haidware and software products and 
they will be labeled as such. If you are interested in writing such an article, please contact me. 

All reviews and product listings were written by the editor, 

A number of people greatly helped in getting this issue of the Newsletter out. I would like to thank 
Barbara Guerlin and Bruce Pray of the Center for South and Southeast Asian Studies at the 
University of California, James Agenbroad of the Library of Congress, Tames Nye of the 
University of Chicago and Ken Logan of the South and Southeast Asian Library at UC Berkeley 
for helping with ideas, and information about articles and books. George Hart of the Department 
of South and Southeast Asian Studies at UC Berkeley has been very helpful in many ways 
including sharmg ideas and equipment, such as the LaserWriter that this newsletter was printed on' 
Many th^s alsoio Tony Stewart at the University of Chicago and Maurice Bauhahn working for 
the World Vision Foundation in Thailand for letting many olher people know about this newsletter 
LasUy, I would like to thank my wife Diana for all her assistance in all aspects of publishin<' this 
newsletter. r- r e, 

This issue has come much later than I thought it would. My apologies to one and all. Issues 3 and 
4 for this year will be combined into a large issue which I hope to be sending out before the end of 
the ye^. If you are working on an article for this newsletter, please try and get it to me before 
December 1 so that it can be edited in time. 

Anthony Meadow, Editor 
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About the Contributors 

Jay Rubin is a professor of Japanese literature at the University of Washington. He has been 
processing words (specifically, the words of Natsume Soseki's 1908 novel Kofu me 
Miner ) since May 15, 1985 and can no longer believe that it was possible fr^m tifsnskte 
Soseki s Sanshiro (Univereity of Washington Press, 1967) or to publish a study of orewar 
&roX fi te^ to Public- Mor,1rWnt en .nd th, M^eiji e%'Llr^ gTof 
wasiiington Press, 1984) oy wnting m longhand and typing the manuscrlDt on the nortable 
gpewnter he received as a high school graduation present On the other hS, he had sS a^I 
his time wntmg instead of making macrons on computers, he probablyTouId Lve a?^^^^^^^ 
bTp^'nfat^m^^^^^^^^ ''''''' '''' ^ '''' BatwT9800T<!? 

George Hart is a professor in the Department for South and Southeast Asian «;tiiH,v<: r,f 
University of California at Berkeley. He has several books to Hs S 4 c^ be Jfached 
cSi?or:SeS;,?r^^^^^^^^^ 1203 Dwinelle^SuS^^'ol 
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Correspondence 



Center for Southeast Asian Studies 
Northern Illinois University 
DeKalb, IL 60115 

Dear Sir, 

We have read with interest the article by David and Douglas Wyatt on the production of 
Southeast Asian scripts using their "universal typewriter." Your readers may be interested to know 
that we have tl 3Ught the "immense" programming effort involved in more direct conversion of the 
romanisation t Southeast Asian otiiographies worth the effort required. 

To date, we have developed precise, pronounceable romanizations and computer conversion 
programs for Tai Dam and standard Thai. The system is broadly generalizable to other Southeast 
Asian languages (Lao, Khmer, Burmese) and would probably require less than a month's effort 
for each additional language. Romanized, pronounceable text can be entered at the computer for 
immediate conversion and then be displayed on the computer's screen, printed on a dot matrix 
printer, or written as a bmary image for later instantaneous screen display. Alternately, the text can 
be stored for later editing or conversion. 

Tlie software is written in Pascal, and runs on an Apple II computer with 64K or more 
Pnntmg may be done on an Apple DMP or Imagewriter. Modification of the software to drive 
other 8-pin dot matrix printers would not be difficult An interactive character set editor has also be 
wntten to aid in the design of new character sets. 

This system has been used to transcribe and print the Thai novel, "Red Bamboo" by Khukrit 
Pramot, and is also being used as a component of a computer-aided intruction project for Thai. 
Both native Thai speakers and American students have learned and used the romanization system. " 

Further details, program source code and documentation, and a disk containing the basic 
programs for Thai transcription are available from us through the Center for Southeast Asian 
Studies at Nothem lUinois University at nominal cost ($10 to cover production) for those interested 
m using and inodifying the system on a non-commercial basis. A paper, "A Generalized 
Transliteration System for Southeast Asian Languages", describing the approach used in this 
system will appear in Seven Centuries Of Thai Writing to be published by The Center for 
Southeast Asian Studies in the fall of 1985. 

Yours sincerely, 

G. Henry (Computer Science, NIU) and 
J. Hartmann (Foreign Languages and Literature, NIU) 
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A Ntte on the Production of Macrons in Transliterated Japanese 

by Jay Rubin 
University of Washington 

^;o7v"^,^* a thorough job of setting out the problems involved in making 

diacnUcal marks in his "Diacritics on Wordstar: South Asian Language Transliteration wSt 
Customized Software" (Volume 1, number 1 of this Newsletter), tLt There would be no voS 
mtemung them here. Since he specifically mentioned that he had not had the opportuni^^^^^ 

^I^ IT' ^ ^^^"^ '° deal with diacritics, and since Too fhe 

methods I have developed seem to solve all the problems that remained with WordStar and would 
cemin y be adaptable to South Asian languages, the results of my limited experience should be of 
S H° '"^^^""2 "^"'^'^"^ °^ °'her such marks in wo?d processing They 

rSLn.. . T^'tiV ^'^^ '^"^^ ^^'her not yet purchased hardware^ or by 
coincidence, have bought hardware like mine. (I use an IBM PC compatible Compaq and a 
Brother daisy-wheel pnnter.) Even with these limitations, the technical data would be valuaWe to 
WordS^-.^°" °'her techniques for creating diacritics. In any case, as TonrStew^t noted with 
WordStar, it is essential to have a printer that will perform backspacing andluoerscn^ts 
prefetaMy one that can respond to codes adjusting the degree of its supers^cripjf ubscrirmoSnt 
tt\.^ T^""?'^ ^"'^'^ increments. I have also included the results of mrefforts 

they'^f^Xr^^^^^^^^^^^^ '° ™^ ^ ^-s satisS^ 



SSI's WordPerfect 



rnJ^Sr V^M^'"°" Softv^arc, Orem, Utah) runs on the IBM PC and 

IBM PC compatible computers such ^.s Compaq and Zenith. There are versions for other 
computers as well. The program contains both the backspace and superscript fSoS al owine 

o'^kSs'aL^^^^ ''''4' "'^"^ ^P^"^' ^l^ough'qSS::' nSmbe? 
or Keystrokes are required. WordPerfect also comes with a so-ca ed macro facility which allows 
he user to save and recdl virtually any series of keystrokes with the push of two k?ySeif 
Lis J° f°"r second de ay while the computer recalls the macro from disk). By saving the 
keystrokes for "o", overstrike, superscript, and "-", aiid naming this combination "Alt-O'^ onf can 
?lnr' ^''.f ^ Ume simply by pressing the Alt key along with the 0-kerwh1 h is'no more 
SSfo T'''-''' a typewriter. (One can jus't as easily save ^h^ 
Keysd-okes for kuzen no daisogi" and recall them with Alt-K, say.) Since WordPerfect 
IS a clean screen" word processor, the overstrike and superscript cies do nTappe^n the 
monitor unless 'for ed t ng numncpo thp p^„^oi n^^A^. -.J.-. L.j " ' appear on tne 



until the "coS^^al^e^^^^^^^^^^ Ur^utttr,^^^^ 

moves a character up only one-third line, which brings the hyphen uncomfo^bly cbse to^^^^^^^^ 

"Ad^^?e uX?tfr? 'wf'^^'' ^"'^ "PP^^-'^^^^ The progSo fndude^^^^^ 

*hZ^Z^^ feature which moves characters up a full half Une and an "Advance Down" to bring 

f/lSen'rr^^^^^^^^^ "P" ~ 

read^sYtt-fe^^^^^^^^^^^^^ 

^:^l^^^^lVrogram^^ gtn^rfzt6 instructions into codes for thelser's partiS nrfnter SSI's 
excellent documentation and telephone support make this a straightforward procedure one that 
fn?S"° ^"PP^JP^^t^^ software. (Tht document PRINTCR.M.&f mentSned oCg^5 of fhe 
Installation pamphlet and contained on Jie Supplementary diskette is a 33 page technical guSe to 
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pnnter dnvers and is well worth printing out.) Although the initial process is somewhat 
complicated, once you have created the character- with-macrcn combinations, they become a 
permanent part of the character font^ in the word processing program and can be used at any time 
as easily as making a capital letter, without the delay involved in using a macro; they appear on the 
screen as quickly as any letter you type, and they are perfectly legible, having no distracting printer 
codes or symbols surrounding them, and no characters "hidden" by the backspace (or "overstrike") 
function. The steps for creating "o"and "D ", using the specific example of a Brother daisy wheel 
pnnter (models HR- 15, -25, or -55) are as follows : 

1. From the Set-up menu, as described in the Installation pamphlet, choose option 5, "Define 
Alt and Ctrl key mapping", and defme Ctrl-0 as decimal code 147 ( 6 ), which is a pretty 
close equivalent to have on the screen. From now on, typing CtrI-0 will cause circumflex- 

0 to be displayed, and ' S o s e k i " will appear as S 6 s e k i ". Exit WordPerfect. 

2. Run the Printer Program, also following the instructions in the Installiition pamphlet. 
Choose option 3, "Printer definition (Examine, Change)". From the "Printers Currently 
Defined" menu, choose option A, "Create". (Rather than risk changing anything in the 
given definition for your printer, you create a new definition modeled on it and play with 
that) Enter any name you like for this new printer definition. Then designate which of the 
thuty-one current printer defmitions you will use as a pattern: presumably, yours is among 
them. Select option 5, "Subscript/Superscript/Underline/Bold". Record the given codes 
for "Superscript on" and "Superscript off, but don't change them. Select item 0, returning 
you to the "Create" menu. Again select 0, returning you to "Printers Currently Defined" 
Exit. 

3. From the main menu-of the Printer Program, choose the character editing option (Number 
4), and when you find 6 in your particular character table (using the direction keys to 
scroU), replace the given string sent to the printer (probably "o<8>'^", meaning the letter o 
followed by the character which has the decimal value of 8 (this is the standard backspace 
code) + a caret (shift 6 on most keyboards)) with a new string composed of o + backspace 
+ a modified version of the "Superscript on" code + hyphen + "Superscript off* code. If 
the given "Superscript on" code does place the hyphen high enough to clear the letter, there 
IS no need to modify it, but this can be done, as follows, on a Brother printer That 
printers code for "Superscript on" is <27><30><3><27><10>2, but changing th-3 <3> to 
<4> actually raises the hyphen just enough so that it leaves a niue space between the letter 
anJ hyphen. In sum, then, the string sent to a Brother printer to produce o is this- o<8 
><27><30><4><27><10>-<10><B>. Of course ii can be produced by replacing the 
given pnnter stnng for Q with u<8><27><30><4>, etcetera. This will not work for 

1 , as explained in step 4. 

4. The <4> in the string sent to the printer is not high enough to clear upper-case letters or "i" 
^^i^-^\W°^lf^ ^^"^P^y "^^"S <5>. The siring sent to the printer for O is 

'<«><27><30><5><27><10>'<10><B>, but because no standard code exists for upper- 
Caws O with a circumflex, we are left with die problem of finding an appropriate character 
for screen display. One choice in this case might be the Greek letter theta, but the oval face 
produced by decimal 002 code seems the right size and shape - and is fun. In this case, 
AIt-0 rather than Ctrl-O is "mapped" in step 1 to display the oval face on the screen smce 
Ctri-0 was taken by 6. 

^' ?c WordPerfect hit the "Print" key; choose option 4, "Printer Control"; then option 3, 
5elect Pnnters ; and designate your newly defined printer as printer number 1. 

1 Actually, you are allowed to define up to six printers, each with eight foists, so you can easily 
switch off these "pennanent" characters whenever you like. If you want to write in French, you 
will not have sacrificed your circumflex-c by following the instructions below. 

2 See Appendix for other printers and codes. 
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Once we have completed the five steps, the 0-key combmed with Ctrl produces on the screen 6 
but prints o , and combined with Alt it displays the oval face vhUe it prints ' 0 . Since the macron is 
now thought by the printer to be part of the character, it never causes the printer to place 
underlining anywhere but where it belongs: under the character. And since the printer codes for 
backspace and superscript are now "built in" to the definition of the character, there is no delay 
whUe the computer retrieves a string of codes from disk: the process is instantaneous and not only 
far simpler than on any typewriter, it is far more refined, macron heights being custom-suited to 
each character. This adjustability of the degree of movement in supers cripting was described by 
Tony Stew^as an exclusive feature of WordStar (and indeed, there is no mention of the capabiUty 
in the WordPerfect documentation), but obviously that is not the case, and it can be especially 
useful to people m South Asian fields, who would want to modify both the "Superscript on" and 
Subscnpt on codes in this way to produce diacritics both above and below characters' a far more 
complicated task than that faced by us in Japanese fields. 



Microsoft's Word 

Word (Version 2.0, 1985, by Microsoft Corporation, Bellevue, Washington) is another IBM 
word processing program, and it, too, comes in other versions. The manufacturer includes a 14 
?Rf™'^!;?M"^ on the "Utilities Disk" called C0NVPRD.DOC which, like WordPerfect's 
fS. iit^'^''^ PT"'^^ ^""^ "'^"^ f ^ ^^^^"^^^^ guide to the program's printer drivers. 
Unfortunately, however, this H.ocument reveals only that Word a far less adaptable program than 
WordPerfect when it comes to unusual tasks such as placing macrons over vowell While 
displaying a circumflex-o on the screen is a relatively easy task (one need only look up the code 
147, in Appendix A, then type the code on the number pad while depressing the Alt key; no 
mapping of keys is necessary - or possible), the only thing that will be produced by this on 
paper IS circumflex-o. Word s printer drivers are not open and accessible like WordPerfect's and 
do no permi the user to substitute arbitrary strings of printer codes for the ones provided 
Overstriking is the sole function of the drivers' "Character Translation Tables"; they canSi 
accomodate superscnpting. Users of Word, then, are restricted to whatever can be accomplished 
frorn the keyboard. Since Word makes quite a lot accessible from the keyboard, this is not a<J 
restncting as it may sound, but the ease, speed, and typographical refinements are not available'' 
1 he underlming of text contaimng macrons is also a serious problem, as outlined below 

One feature that niakes Word simpler to work with is its definition of "Superscript" as a one- 
half (rather than one-third) reveree hne feed: all our raised hyphens wUl be of the height necessary 
Tcft^J- VF^' [o^e^-case letters. On the other hand, once the superscript function is 
activated in Word, all subsequent characters are printed one-half line higher than before untU the 
computer ,s told to return the printer to a "Normal" height. (Word's "Superscript" toggles on and 
WnHpJ.S'^^^'/"'^ "^T WordPerfect's "Advance Up" function; supeffcripring b 
WordPerfect works for only only character and turns ofc .utomatically.) h s 

The steps necessary for producing any vowel with a macron, then, are as follow^- 

1. Type vowel. 

2. Type AIt-8, usi^ng the 8 on the number pad, not the top row of typewriter keys. This 
produces a highlighted rectangle with a smudge in the middle of it, which is the symbol for 
"So B^-s?ki'' ) ^ ^^"^^^^ "^""^'"^ °" '•'^ "Soseki" will appear as 

3. Type a space, then press the left direction key to move the highlight one space to the left of 
the end mark. Othenyise, you'll be told in the next step that you can't edit the end mark 
'^sTpVsSip^Tress E^^^^^ '^^^ '° "Normal-Superscript-Subscript", space bar to 

5. Type hyphen. 
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6. Repeat step 4, but space bar to "Normal". Press Enter. (Note: if you are underlining, this 
step will turn off the underline function, and you will have to turn it on again. Better to ad^i 
underlining afterward ~ if at alP.) 

7. Press the right direction key, returning highlight to end mark. Backspace to remove extra 
space inserted in step 3. 

Fortunately, Word allows the user to save keystroke combinations so that this procedure need 
not be repeated each time. The instructions for "Creating a Glossary" explain how the above steps 
can be stored and recalled, each combination of vowel-with-macron being given its own name 
Assuming we give the name "o" to the combination "o", we can insert "o" into the text by typing 
^" ^° ^^"^^ ^ permanent file of macrons in the Glossary, be sure to type 
Y for savmg the Glossary before reconfirming that you want to Quit. 



Conclusion 

Having devoted far too much of my life akeady to the question of how to place funny marks 
over letters, I should emphasize in conclusion that as little of one's writing time as possible should 
be taken up with getting the procedure to work. This, of all things, should be a "set it and forget 
It matter. The system I have built around WordPerfect admirably fulfills this desideratum If 
anyone can work out the underlining problem with Word, it should run a close second 



•5 Dependmg on the characteristics of your printer, the printer driver supplied by Microsoft may 
cause the underlinmg beneath text containing macrons io be shifted several spaces to the right. 
This can happen because Word does not distinguish between characters and the printer codes for 
overstnke and superscript; it "teUs" the printer that the text to be underlined is somewhere other 
than where it really is. A Microsoft technician informs me that this can possibly bft corrected by 
modifying bytes 144 and 150 of the printer driver, but I have had uneven results with this 
procedure. 



erJc 
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Appendix: Additional Printer Codes for WordPerfect 

The following printers defined by WordPerfect use the same code for "Superscript on" as the 
Brother and would presumably respond to this modification in the same manner: 

• C. Itoh Starwriter and Printmaster 

• Diablo 630 ECS, 620 and 630 (Daisywriter) 

• NEC 3515, 5515, 7715, 3525 and 7725 

• Qume Sprint 5, 9, 1 1 and 1 1+ 

• Texas Instruments 855 

• Toshiba P1350 andPl351 

• Dataproducts DP series 



a zero 



The following are other printers with their very different codes for "Superscript on". Note that 
TO looks like 0 and capital o looks like O: iNuicuidi 



•Centronics 351: <27>L 

• Epson FX, MX-Graftrax and MX Type III: <27>S<0> 

• 5? 2686A LaserJet-B: <27>&a.35V<0><4> 

• yh^Jppnter and 5218: <27>A<2><27>2<27>] 

• IBM Quietwnter: <27>S<0> 

• NEC 2050 and 3550: <27>A<l>^27^7^-77^7 
. NEC 3510, 3530, 7710 and 7730: ^1>]^21>9 ^^^^ 

• Okidata Microline 93: <27>J 

• Toshiba P1340: <27>VP@B 

Some of these look more promising than others; the only printer I can be sure of is the Brother. 
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Indian Fonts on the Macintosh 



by George L. Hart 
University of California, Berkeley 

As my field is classical South Asian languages, and ?,s I work with Sanskrit and Tamil every 
day, I have long been mterested in ways to get the scripts for those languages (and for other South 
Asian languages) on paper m a legible form. This problem has been exacerbated by the fact that I 
must prepare vocabulary for students, and my handwriting is terrible -- students cannot read my 
English handwriting, much less my Tamil or Devanagari. Thus, when microcomputers started 
appeanng m large numbers about five years ago, I bought an Apple II and began to learn how to 
use It. 

After a great desJ of work, I was able to overcome one problem -- 1 wrote a program that 
produced the proper diacritical marks for South Asian languages -- but I still -A as unable to pui 
Tamil or Devanagan onto a pnnted page. 

Then, about a year and a half ago, the Macintosh appeared. It was immediately apparent that 
that machine was different m radical ways from its predecessors, and I set out to discover whether 
It would be able to produce Devanagari and Tamil more easily than other machines. I was 
fortunate enough to get m contact with a senior programmer at Apple, Mark Cutter, w! j wrote 
LisaDraw and MacDraw (which are shape-based drawing programs), and who himself was 
interested m implementing Devanagan. In fact, I discovered that Mark had already developed a 
Devanagan that runs on the Lisa, the predecessor of the Macintosh. While his implementation ^ad 
some rough edges -- which was inevitable, considering he did not know any Indian language - it 
was still remarkably good, and it showed that Devanagari could be made to work on the Lisa and 
on Its successor, the Macintosh. What made the system especially appealing was that Devanaqari 
did not appear only on tne pnnted page; it appeared on t\ic screen also. In fact, the screen and the 
pnnted page were identical. 

..ri^i^^^ ^ 5^®" ^"Saged in a project to develop and perfect South Asian 

scnpts on the Macintosh The rest of this article concerns what we have discovered, the progress 
we have made to date, and what we hope to do in the future. progress 

It is extremely difficult to produce Indian scripts in a satisfactory manner on most computers. 
The reason for this is that the shapes of the characters are stored in ROM and cannot be changed. 
Of course, most computers - the Apple II, the IBM PC, and most others - either have or can be 
given a graphics mode, and it is possible to write programs that produce whatever character one 
wishes on the screen m graphics mode when a key is pressed. The screen can subsequently be 
dumped to a pnmer, giving a pnnt-out of the characters one has written in ^ / 

There is one disastrous shortcoming to this procedure. A good word-processing program (or 
ouier sort of program m which one might wish to use non-Roman characters) takes many man- 
years ^0 produce. The graphics routines that are developed in the above procedures cannot be 
accessed through any standard word-processing programs. Rather, whoever uses them must write 
his own word-processor which, then, must be rudimentary, as he does not have the resources of a 
large software producer at his disposal - he cannot have several people working full-time on a 
processor for, say, Hindi. Thus, one is left with a rudimentary process that lets one put 
Devanagan or anoUier alphabet on the screen and to print it out, but which lacks the capabilities U^at 
make computers really useful: the abihty to manipulate text in a complex fashion, to use advanced 
data base techniques, etc. t- > ^ aurau^^cu 

The Macintosh differs from these computers in several ways, but in one respect that is crucial 
for those who wish to use non-Roman alphabets. Everything the computer does on the screen is 
"S^t'i ^"u"'^'^ microcomputers, when you press "a" on the keyboard, the machine 
looks m ROM somewhere, retneves the pattern corresponding to "a" and puts it at the cunent 
screen position On the Macintosh, when you press "a", the operating system looks to see what 
the cun-ent font (character set) is, what the current character size is (usually 9 12 18 or 24 
points), and what the current character attribute is (nonnal, bold, italic, shadowed, ou'tlined 
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Most Macintosh programs have the aoility to select the font you wish- in the maior won! 
prpcessing programs, it is possible to change fonts, sizes, etc. at any point CsTtTa-Te^; 
Sh'^^'/if ?'f ^^'^ '^^^^"^"^ ^^^^^ °f ^P^' ^ff^^^nt sizes onyTe 'anf even different 
fo2it;n<;^^f" ^l?"' n^ost complex functions (search, replace, justification, footnoSnrS 
formatting) of word processmg are available in all alphabets iw^noung, rancy 

The process of implementing Devanagari, Tamil, or any other Ir 'ian alphabet that i<; left tn 
right IS a relauvely simple one. It consists of defining a matrix mapSinfrbit manVfor p^^^^^^^ 
character, decidmg the w dth and placement of the character, deci^ng Sue nuiS^^^^^^^ 
for the font, and then putting Uiat information, all coded in th^ right foma\ c?Ued a '^e^^^^^^^^^^ 
the operating system (which is contained in a file called ^ystemT VSate v thS^^ VvTf 

EX'tn rn -^^^ "'^''^^ ''^^l ''^'^^"^P"^^ ^^^^ ^^ks ia-aralativeTy painSmnneV^ 0^^^ 
the mouse to fill m squares on the screen and so create the character and decide.: nnihi>" ^^fh !^^ 
placement of the character by manipulating little pointers xvithS mouse 

Mill, Indian alphabets have certain special problems, and it was imoo^^ant to see whefh^^r th^ 

With a few changes that made typing on the computer simpler. For DevaiiagS Thf^Sin??^^ 
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The next step was to actually implement the fonts, a process that was hampered by lack of a 
good font editor for th-? Macintosh (which made it necessary to do font development on the Lisa- 
there IS now an exceUent font editor for the Macintosh marketed by Altsys Corp. of Piano Texas)' 
1 began with Tamil, as its alphabet is much simpler than Devanagari, and then took the De'vanagari 
already done by Mark Cutter and modified it to conform to the IBM keyboard and to the needs of 
Sansknt and Hindi. Since then, development of these fonts has been continuing, both by me and 
by others. The foUowmg things can be reported: ^ ^ 

1. Apple has developed a new "keycaps" desk accessory, which allows one to have a 
picture of the keyboard for any font one is using on the screen in a window next to the 
document he is typing. This makes it easy to learn to type in an Indian alphabet, as the 
keyboard layout can be consulted as often as necessary. 

2. M.G. Srinivasan developed an improved Devanagari taking the work of Mark-Cutter and 
myself as a model He has called this Kanchi; I am now engaged in adding punctuation 
to this alphabet and modifying it to make it easier to type. When this is finished, Hindi 
should be very neail> as easy to type as English. 

3. Using a product callbd Thunderscan - which digitizes images from paper or photographs 
into the format of Macintosh O^acpaint) drawings - 1 was able to read in TanSl ch Jacters 
and use them to create a much more esthetic Tamil alphabet than my original. I hope to 
mon£^ distnbution (along with the new Devanagari) in two or three 

4. Professor Donald Becker of the University of Wisconsin, who has done extensive work 
m creaung soitware that pnnts out Devanagari, Telugu, and Arabic from the IBM PC 
onto a Toshiba pnnter (the screen being in Roman, the printing in the desired alphabet) 
has acveloped a Gurmukhi font for the Macintosh. He ha? just recently sent me a 
prehmmaiy version of Telugu for the Mac, along with a manual that describes how to use 

^' Script fo^'?h"fS^ ^'''"''^ ^''^'^'^ ^^^^l°P^d ^ Tibetan 

6. Two different BengaU scripts have been developed. One, by Dr. M. Z. Iqbal of Caltech 
IS being distnbuted on the public-domain font disk. • xqom on^aitecn, 

7. Professor Grant Olson of Cornell University has developed a font for Thai. 

There are several things that remain to be done. First, we need to get more Indian abhabets on 
the Macintosh. Equally important, we must make it possible to printfhe fonts on he Smriter 
Apple s new laser pnnter that produces output of publishable qEality. The difficulty with this is 
SfJiij^f '^"f u'^^" 5°^ bitmaps (as the Macintosh itself does) buUs shape 

(like Macdraw). It has software that allows it to create a bitmap of any size from a given shape 
thus insuring the best possible resolution for each character (the LaserWriter has 300 dots pe^ 
inch). When we have Indian fonts on the LaserWriter, it will be possible to compose a book on the 
Macintosh using Hindi, Tamil, English and other Indian alphabets in different s zes s^Ies and 

(or any specific program) can be made to come up and interact with the world in Hindi Tamil or 
Marathi instead of English. Programs on the Macintosh store the messages and mS^Sfth^^^^^^ Z 
specia resources that can be modified using a resource editor. This means thTt one ca^ take a 

entu-ely m, say, Tamil. All of the messages and text the program uses would then appear in that 
devS. " Tu'' ^^'^ "^""'^ M^'^i'^^sh and almost auTth^p^o^^^^^^^^^^ 

noSfnf t° Indian language that we have developed a font for iSe 

potential of this for such things as elementary school education is enormous 

It should be emphasized that even now, the same Macintosh and the same orogram can be med 
to pnnt a letter in Devanagari, Tamil, English, Gumiukhi, Bengali, Telug^o^ TiSS, or in any 
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comoination of those anguages; one does not even have to change disks. Ultimately all of the 
eft-to-nght Indian scripts should be available for the Macintosh. Indians from Sfferenfp^s of 
the coun^ who work in the same office could produce materials in all the major lanSages^e^ceD 
Uidu: different language versions of an official document or form could be piSSrintS on 
the saine machine; a Bengali and a Tamil could write letters in their respective hTguages^^^^^^^ 
whh"^pSlfn^H?f are running. One hopes that this univLality oflhSmosh 

with regard to different languages and scripts wiU contribute to the integration of the India 

To receive copies of the Devanagari and Tamil fonts for the Macimosh, send a check for $10 
made payable to the Regents of the University of CaUfomia to: 
Professor George Hart 



Department of South and Southeast Asian Studies 
1203 Dwinelle 
University of <^alifomia 
Berkeley, CA 94720 
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Examples of South Asian Fonts on the LaserWriter 

A Sanskrit verse from the Gita: 

A person whom all desires enter 

as the waters enter the ocean which, being filled. 

remains without motion - 

he attains peace, not the man 

who mns after desire. 

Two lines from the Tamil Pwanamini: 

LUiTgJliD aiELlSg lUiT61i^l5li) (SS(STfllt 

All lands home, all men kin; 

evil and good do not come from others. 

Some lines in Telugu: 

indftra WeSire''^'^^''^^^^^' ^^^^"^^^''^ ^'^5' was the golden age ofAndhra culture and 
A verse in Punjabi: 

ifS/ ' r^^^/ ^ ^'^^^ Of coal with soap and put it into milk and curd to make it white, but to no 
effect. Then I processed ,t with different dyes to color it but it did not change its color It had 
become black on separation from its beloved that would not go unless it is reunited Put it in fire 
and you will find it glowing with all its beauty. 
(Bhai Veer Singh) 
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Reviews of Books, Journals and Articles 



Note that addresses are included for some publications which come from smaller or lesser known 
publishers. The editor has wntten all the reviews in this section. 



Books 



Jn^?' Databases, Van Nostrand Reinhold, New York, 1985. aSBN 0-442- 

28185-4). This book presents a simple, non-technical overview of document databases. No 
attention is given to languages other than English and there are only a handful of references to 
papers ana other books. This book is of little or no use to the readers of this newsletter. 

Pocument Procping, Wiley & Sons, New York. This book gives a technical 
overview of the topic descnbed by the title. The chapters are- 

I. Introduction 
Z Input Scan 

3. Shape ManipulatiMi 

4. Digital Halfwning and Shading 

5. Data Compression 

6. Character Feature Extraction 

7. Document Preparation 

8. Document Retrieval 

9. Text Recognition 

10. Document Transmission 

II. Output Scan 

This is u rather technical book written by a sta.f scientist at Xerox Corooration Each char^^er 
includes a several page bibliography which makes this a good source boTfor Lsfneedine an 
overview. It was obviously intended to give only an overview of thftopics covered bu^ 
SSJS'anTsfon '^"^ ^" ^"^^^ OCR (OptiLrCha^cter 

Joan Knoerdel, A Survey of Standardization Efforts of Coded Character Sets for Text Processing 
National Bureau of Standards, Report Number NBS SP 500-81, 1981 SaS^of 

Tv"f ^^"^ US Government Printing Offi;; w^ 

20402) This technical report describes tlie standard character sets used in the United States The 

ch^LtTetf Itl^o^^^^^^^ i^"^?' fi'""'' ^"^'^ 'he details of it and oSieJ ^lated 
Character sets. It also descnbes the standards organizations that generate them, as well as new 

T:^lt'r^' ^'^^ Description Languages". These are general cTdes that^e useS to 

describe items such as page size, margins, headers and footers, etc. Sat in the futurrshould allow 
the-direct exchange of documents from any word processing s;stem to another! 

G. Sampath, An Introduction to Text Processing, River Valley Publishing P O Box 997S9 
Jeffersontown, KY, 40299, 1985. (ISBN 0-9615070-0-4) ke subS 
approach to the study of text structure and operations and the design of text processing sofS^ 
I IS based on a graduate level course in Text Processing taught by the ^author at Syracuse 
""""'""iSew' ""'"'''^^ of LouisviUe. The contents are (by siction and chapt^?): ^ ' 

1. The Text Processing System 
n. Texture 

2. Character Processing 
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3. String Processing 

4. Text Storage ancf Retrieval - Files and Buffers 

5. Text Patterns - Tokens, Words, Paragraphs and Segments 

in Form Patterns - Creation and Modification: Text Entry and Editing 

7. Display Forp • Extraction and Generation: Formatting and FormaU2d Editing 

IV Content ' Gsneradon: Lexic?l Ajialysis, Parsing and Syntactic Edidng 

V Interfkef ^'^''^'''^^ Analysis, Context, Semandcs and Usage 

10. Text Compression 

11. The User Interface 

Bibliographic notes are at the end of each chapter and a rather large bibliography is included as 
Appendix C. This is the only technical book that I have ever seen that deals with the basic issues 
of text processing (excluding more sophisticated topics such as computational linguistics) This is 
an excellent book and is highly recommended for anyone interested in designing and implementine 
text processing systems. ^ ^"ung 

T.C. Ting (editor), Chinese-Character Processing for Computerized Bibliographic Information 

nSBN^kll'^MlTi^r^^^^ ^5°°' 0"^^^' Canada KIG 3H9. 

(ISBN 0-889'^6-441-9, IDRC-239e). This is a summary report of an international workshop held 
m Hong Kong on December 17-20, 1984. There are five short articles and a 15 item bibliogmphy 
It IS pnmanly of interest to libranans, and developers of systems for librarians. & f J- 

The following technical reports on T^X and METAFONT (see the previous issue for more on 
these two programs) are avaUable from the Stanford Computer Science Department. They can be 
Q^^nl /f""?- Computer Science Department, Stanford University, Stanford CA 

94305; telephone: (415) 497-4776. California residents should add 6.5% sales tox 

Report Author Htle q^^^ 

ri'mi v^^L pi,cc l^^^^' slanguage for Chinese character design" m 
Plass Breaking paragraphs into Unes" ^ H 

rl'svn P^l ..On the problem pT inputting Chinese characters" m 



r^c O/io -r hX o yaxa^iafjiio nivj unci) 

ri's7n P^l ..On the problem pT inputting Chinese characters" m 

ci-IS Sdi ..gSp^gg™^^^^^ M 

CS-901 Fuchs,Knuth "Optimal font cxhing" i'^^ 

rl'lm ;;Using string matching to compress Chinese characters" 250 

rfl^ [Interacting with grapffic objects" 495 

Qfifi RkP^^iL rhoch "■^f.W^'"' ^ ^ '^'g" ^J^^ composidon in Indian scripts" 645 

CS-974 BrSb?'°^'"^&S4Sn?''''^^''^ II 

rfVil ;;WordHy-phen-a-tionbvcom-Dut-er" 4 70 

r^iln ^"5 LKsonsIeanjedfromNffiTAFONT- to5 

r^ls? TheVVEBsystemofstnic'ureddocumentadon" glO 

JH^"'". Literate Programming" S'i^ 

C5;-q85 fJnmiipI "C.-ncf nroj«*rcv" ^ 2.45 



CS-985 Samuel "First Grade TEX" ,q<- 

^i'lmT D^ann^'^en "How to Run TEX in French" ^'i^ 
CS-1027 Knuth "A Torture Test for TEX' fl^ 

If the cost is described above as "M", then the report is available in microfiche. All other reports 
are avadable on both paper and microfiche. Each report in microfiche costs $2.00 Man^of thes^ 
reports do not deal with languages other than English, but are listed here because they may still be 
of some interest. I have not read any of these reports yet. ^ 

The Proceedings of the 1985 International Conference on Chinese Computing, published by the 
Chinese Language Computer Society, is now available for $40 (US and Canada) or$45 in the rfs? 
D^^'^f "^^'Jo- J ""'^^"^^ C.N. Liu, CLCS Treasurer, 18-1 14 IBM ReseSch Center 
Box 218, Yorktown Heights, NY 10598. I have not seen it yet. 
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Journals 

f Jin JJf^f J21/year (US), $23/year (Canada, Mexico) with higher rates for other countries is 
available from: Byte Subscriptions PO Box 597, Martinsville, NJ 08836. WillSm Ra°ke wSe^^ 
Byte Japan U regular column in Byte magazine, one of the more well known miJrocom^^^^^^ 
magazines. He descnbes many of the new Japanese computers, some (bur^S)^S S wHI 

T^ejoumaJXomputerProcessing of Chinese and Oriental Langi^ is published by the Chinese 
L^guage Computer Society, which was described in the previous issue of thrnewslette? cS 

Fll^^S^iSoVM^^^ ^-".-Jn"'!^^^ MernbershTp cl"S Gera 

Volume 1, issue 1 (July 1983): 




f V'Kg'ci"^^^^^ ' languagefoFChfe^e^cblprocessing", pp. 40-58. 

m^f',m 59-70^ ' ^ ^^""^ text-to-speech synthesis system based on an initial-final 

^-^'Si^ "TS? 7 i-T7 ' °^ anti-aliasing technique for displaying high quality Chinese 

X. Wang, "Systim design of a computerized and laser typesetdng system forChinese characters", pp. 78-85. 

Volume 1, issue 2 (December 1983): 



{•£ Tsay and I.C. Wu. Tomputer-^ siste^^^^^^^^ 

J.M. Unger. A mnemonic code for Sino-Jrpanlse chLcters (K^^bTed ent^'^?U!^-^^!l^«\ pp. 1 35. 

^ ^chir"£t^^^iT5%'!^'^'^ ^'^^y 'PP^^ '° of handprinted Chinese 

VOiUme 1, issue 3 (May 1984): 

^rflSr ' "'^'^^ "^^ °f Kanji fonts and inhouse printing/publishing system", 

^-^im^,fp\W6^°^^^^ ^^'^^^ °f ^^^^ composiuons of frequendy used Chinese 

T ill "F nnoiiijoo onr» Irinaf-:«-^ r^; » 




..^^ »a ^.n. KwoK, -A database implementadonof an inteScSe^i'tekphon^^ directory", pp. 195- 
^■^■i?ste?n>^^S^^il'^^''^ computerized abacus for office auton.auon and computer-aided 
Volume 1, issue 4 (November 1984): 

^■^■21?224: "E'^Periment on Hnyin-Hanzi conversion Chin- -ord processor", pp. 

C £ Chin an"d R w ?fc "7^°-}^\?^ ^"S^'}8 ^r Chinese input systems", pp. 225-235 
H R nJ^^^rv&l' ..^^^'"^^°u'? °^ ^^'"?se input methods", pp. 236-247. 

H:w.HS"g'an%Y.X'^g^ 

275. ^-i-nwang. Microcomputer CAI for Chmes . lage with character generator", pp. 268- 

I.M. Liu, "Recognidon of fragment-deleted characters and words , p. 276-2^7. 
Volume 2, issue 1 (May 1985): 

C.H. Chen and Y.H. Pao, "Computer recognidon of Chinese language (Mandarin) homonyms", pp. 1-22. 
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"^^'^^^'^ of Mandarin by means of Chinese phonemes and phoneme-pairs (TIFH)", 
^'^'H-si" ^'^'^"''^'^ comparing various methods of inputting Chinese characters into computer", pp. 

^' ^hfonnaSeJc^^ng" 'pp ^"g'^^^^^'^^^tion of the structural elements of Chinese charxters in 

Occasionally there are interesting articles in Infoworld, a popular computer ne\ 's weekly It was 
tormeriy published m a magazine forma', but has recently returned to its initial tabloid form 
Several articles that are of interest are: tauium .orm. 

Alexander Besher, "Hong KonA microcomputer industry", 12 March 1984, pp. 79-81. 

David Rothman, "Saudi Arabia^mbrxes Micros", 23 April 1984, pp. 25-26 

Peggy Wat^ Legends Inscnbedon rloppies", 21 May 1984, pp. iS-26. 

°" ^^'^ a ^^'^ ' 28 May 1984, pp. dd-68. 

Juditii Walthers, Translators GainHuency", 13 August «84, pp. 35-37. 

Mark Fniin, T/Maker makes inroads into the Japanese market"rVoI 5(49), pg. 75. 

Language Monthly is a periodical that seems aimed primarily at translators and language teachers 
They feature articles on translatmg, machine translation, trends in schools and universities as wdi 
as reviews of dictionanes and periodicals. It is available for £17 per year in England and $28 ner 
Ng'i 5"^, Englanf '^^"^'^ totPraetorious Limited, 5 East Circus sSt, Nottingham 

Research in Word Processing is a new newsletter. It is published to be a "clearinghouse of infor - 
^^SuV-T"^^ ^° '^^"'P.^^^-based writing instruction at all educational levels - from elementary 
u-uv . "^"^ ^^^"^^ will feature original research, article abstracts 

bibliographies and reviews. It is available for $12 per year in the US fro» : The feStors Research 
?7?Xi1q^qT'?V"^ Newsletter South Dakota School of Mines and Technology, ^ Qty SD 
57701-399:>. I have seen only one issue so far. It looks very good, especially for DeoDk 
interested m usmg computers in education. =>pci.icuiy ror people 

^^^S pSti^^^ Translation, a newsletter mentioned in the previous issue of this newsletter, has 

Visible Language is a quar rly that deals with the visual aspects of language expression It h 
available for $15 per year (for individuals) from: Visible Language, Box 197rc/o T^veland 
Museum of Art, Cleveland, OH 44106. There has been a specid issue edited by J Marshall 



Articles 



n^S ^Q^< "J"^aica °"^ation in Israel - An Overview", Judaical Librarianship 1(1), pp. 9- 
II (Fall 1983). This article descnbes all the current "Judaica on computer" projects in Israel 
Bnef descnptions of the various systems are given. pi^jci^ts, in israei. 

Joseph Becker, "Typ Chinese, Japanese and Korean", Computer, January 1985 pp 27-34 
This article discusses u nature of the difficulties in typing in the ideographic languages and how it 
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Kathleen Burton, "Over There: IBM In Japan", PC Magazine TmIv IQR^ nn 97^ osn a 
technical article on IBM in Japan, especially on W BK5Tjap;ne'e ?C ' ^ 

rHp'^nriS' ST''"^'^^^^^''' Asian Languages", Library Software Review 3 
rrWnu- T ^^^^ article reviews the Research Library Group's and OCLC / Asiaeraohics's 

Swi^fr Tl,"/'P'""f ■ 'y''^"^'- "There is also a small bibUography S sever^^^^ 

articles from trade journals on CJK systems and related topics. 

Jere Fleck, "Multi-language Use: A Survey of the Problem", Capital PC Monitor 4(7) nn ?8 49 
?Jnit^Pr T?'''"r " °^ multi-lingual users on IBM Pcl R wS publ k^ed^ 

David GUner, Ellen ::ovacic and Hebert Zaften, "Hebrew Calaloein. via Miorocomnuter" <!^n 
S^rit^f ";k""3Tf ?P- <S'P' 1983). This article discribes °eSeloied"o! 

developed for [he library at th. Heb-;.';; &oVe8S'n'ed?n'Sovt^^^^^^^ ^e^fs"^ 

94°rju*ne "S S"- ^"'^ " Double-Headed Approach", Dr. Dobb's Journal issue 
flS^. If- J'"' describes an implementation of the Forth computer laneuaM i! 

thtn^f- ^ " ■'^"u',"''i V"' °f '='='>"'=al nucricomputer mafiaSoubUshed 

d^t^fa^tSSSM^--'-^^^^ 

Masasuke Merita, "Japanese Text Input System", Computer 18(5), pp. 29 35 (May 1985^ Thi. 
article descnbes a new approach to typing Japanese into a computer^ Kew apS^^ 
nnhi-r^T^f ''ri.^^^S^"^'"'^ 'he saSie principles as the Dvorak keybolrT coSr^^^^ 
enlSsiSVe US^^^ "^^^P"'" ' '''''' '^e primary assoSfon fo ^Sr^nlc 

Irr^nWn^!' 1 Sfi? ^^1!^ f ^ Computer Printers", South Asian Library Notes and Queries issue 
18 (Spnng, 1985). This bnef article lists several sources of Indie fonts primarily f^H^f a nni. 
Macintosh, but some for other machines as well. SALNAQ is pubS bTthe Co^^ 

^ ^0^37'°"^^'^°" ■ '''' ^rJ^^^ Ub^: WSTt^ 
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Recently, it seems as though every computer magazine has reviewed most or all of the keyboard 
macro programs such as ProKey. Tliese programs are of interest to all readers who use IBM 
PCs or compatibles, as Tony Stewart's article in the previous issue of this newsletter 
demonstrated. Three such articles that I've run across recently are: 

1 ,^oP^^^^? Obregon, "Power Plays At Your Keyboard", PC Magazine 4(22), pp. 167-175 (Oct 
29, 1985) Reviews ProKey, SuperKey, Keyworks, RE/Call, Newkey and Smartkey. 
inocx S*^" Walkenbach, "Keyboard Shortcuts", PC Tech Journal 3(10), pp. 131-144 (Oct 
1985). Reviews ProKey, SuperKe> Keyworks, P^Call, Smartkey and Keyswap 

Dan Swearingen, In the Right Key", PC Products 2(10), pp. 33-49 (October 1985). 
Reviews ProKey, SuperKey, Keyworks, RE/Call, SmartKey and Keyswap. 

Now that Apple's LaserWriter and other laser printers are out, there is more interest in typography 
These pnnters are dramaUcally choiging our views of what good quality output looks like He^e 
are four articles on this topic. Note that even -hough some of them are now several years old thev 
are still quite relevant ^ ' ^ 

1 S-^^^^ ■'^•^Selow/'Font Design for Personal Workstations", Byte 10(1), pp. 255-270 (Jan 
1985). Discusses the difficulties of designing fonts for computer workstations 

K.cvc 95^1" "Digital Typography", Scientific American (Aug 1983). Discusses the 

basics of digital typography and how fonts have evolved from older methods of setting type 

Jonathan Seybold, "Digitized Type: What is it? What does it mean for typesetting and word 
processing?" The Seybold Report 8(24), pp. 3-17 (27 Aug 1979). Gives an exceUent 
presentauon of what digital typography is all about 

T ^ Bigelow and Jonathan Seybold, "Aesthetics vs. Technology: Does Digital 

Typesetung Mean Degraded Type Design?", The Seybold Report 10(24), pp. 3-16 (24 Aug 1981) 
Giv^ an explanauon of what a typographic designer looks for when desiring a typeface, as we U 
as a bnef history of typesetting. ^ ^'H^iai-*^, as wcu 

The Seybold Report was described in the previous issue of this newsletter. 

The following ^nicies are all the articles found in all issues to date of the journal Computer Science 
and Informatics that deal with Indian scripts on computer. This journal is published by the 
Computer Society of India, descnbed in the previous issue of this newsletter. It was formerly 
kiiown^the Journal of the Computer Society oflndia. This is a difficult journal to locateTthe 
United States. For those hving in the San Francisco Bay area, the only library th^t has a complete 
' /SnH^f/nH^*? p Mfhcmatics Library on the Stanford University campus. The articles 

b^&cffi" °f P^R^ P«^^ that produce output o'n (falcomp ploS hS rIeTof ou^uftf 

P. NarasiiTJiam3. ftasada and V. Rajaran.an, "Code Based Keyboard for Indian Languages". JCSI 2(2) dd 33- 
c M 37. Qivesdescnpuon of current and proposed keyboards fo? Indian languages ^ ^' 

S.N.S.Raj.asekaranandB.L.DeekshatuIu,"6enera^^^ JCSI 4m nn 3 fi 

r.^'yJ^ proposals to generate and recognise Telugu charactls. ^ ^-naracters , j lcji 4(zj, pp. j-e. 

lT2&"fif rv^^^Soi?n?".'' "Computer Recognitipn of Telugu Vowel Sounds", JCSI 7(1), pp. 

d^vSjidoLSyCll&^T^^^^^^ recogn-seTelugu vowel sounds. The system wal 

^•''|i97tim iS'S ^^'"8" Characters", JCSI 7(1), pp. 42-43 

'■nQ7^jf nff ^- ^' . f" ^lekshatulu, "Generation of Artificial Character Database", JCSI 8(2), pp. 49-54 
^ I.- ^1?I^LP,^5'?Pm ^l?^u"'E'^°^..9L8^"^'^""8 test data for OCR work. ^ ^' ^ ^ 

nlo^V^r^^ -u^- ^^f'^^ "Character-ROM Based Display for Indian Languages", JCSI 1 1(1). nn 9-i4 
p c (?981) Descnbes a hardware system for generadng Indian scnpts. ^ 

"2(n nn nln ?7 no«W- ^^"^ Languages: Design Information Reports for Four Scripts"', CS&I 
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A specml issue of the Journal of the Institution of Electronics and Telecommunication Engineers 
devoted to Computer Applications in Processing of Indian Languages and Scripts" has come out 
Th^ special issue was edited by Dr. R.M.K. Sinha. Copies of this issue may be ordere^fZ: 
ine iiduor, The Institution of Electronics and Telecommunication Engineers, 2 Institutional Area 
Lodi Road, New Delhi 1 10 003, India. The tities of all articles and letters are as follows: 



Artides: 



RM.K. Sinha, "Compute Processing of Indian Languages and Scripts - Potentialities and Problems" dd 133-140 
^•^D^vSlShJ^cSk^:^?^^^ TompSterRecognidonof TSSykSf^'^^ 

tBmSA%''^nfJ'vr?}^^^^^^ ^°''R?5£?P'^P" and Analysis of Majorlndian Scripts", pp. 155-161. 
l2rfiS>p.^?167: ' ^""^"^ Producaon in Hindi and Thai via a Software iVian-Machine 

^' ^imptelffi pp'"i68^^^ of Devanagari Composite Character Pattems for Data 

PL, Erniliani, P. Gmi^i and A.Tronconi, "Speech Synthesis for Italian Language and its Applicadons", pp. 173- 
BehtwzParhami, "Standard Farsi Informadon Interchange Code Keyboard Layout: A Unified Proposal", pp. 179. 
9^^^\ A.K. Datta, "On Machine Recognidon of Telugu Speech Sounds" dd 184-189 

PJCGhosh, "gasic Design Issues in a Muld-Lingual Type Font Design andTypesetdng WoAstadon", pp. 196- 
^■^Scrip^te^^pS^ S^t^aramaiah. "Alphanumeric CRTTerminal for Telugu andHindi 

A. Nageswai^ ftao, "New indo-Roman Script for Transliteration of aU the Indian Languages", pp. 215-222. 
Letters to the Editor: 

T.R. Rammohan and B Jsf. Chatterji, "Recognition of Distorted Kannada Characters" dd 223-22'> 

'•iS^-iS-S^'^^f-^ Variable Width Composite Consonant Text in Character 

^^•(?iSSt»o^^^^^^^^^^ 131-^^ Cues for English Stop 

R m¥ ?nh.^^'„!?n"l^'?'"2 "'^!?>u^ Prerequisite for Word Processing", pp. 238-242. 

m 243-m '^'^^'"^ Transhteradon from Roman to Devanagari and Devanagori to Roman", 

R m"k ^;nh^p!?)i'"l"c°i^f 'S^a ^Sf Devanagari Script and Numerals", pp. 246-248. 

R.M^.^Smha andK.S. Singli, A Program for the Coirecdon of Single Spelling Errors in Hindi Words", pp. 249- 
°''^Ss?-mifffi^ "Representing Ahamia, BengaU and Manipuri Text in Une Printer and 

Zhang Liansheng, "A Dcsigii for the Computerized Processing of Tibetan Scripts", pp. 257-260. 

This is an excellent collection of articles on Indian languages on computer. While there is some 
hngLron^^mp^tef ' '^^'^^ recommenced to an/one working wJtVlnS 
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Product Listings 



In this section, we will report on hardware and software products that are of some interest to 
the readers of this newsletter. Note that none of these items is being reviewed All information 
has been supplied by the vendor. If someone has used any products here and writes a review we 
would be delighted to pnnt it. If you are a vendor of hardware or software that would be of 
interest to the readers of this newsletter, you are also invited to submit an article for publication. 

Hardware Products 

Xei ox Corporation: 6085 and 8010 Star Workstations 

Description: Coniputer systems with specialized software allowing word processing in twenty 
languages, including Chinese, Japanese and Korean. o f & y 

These workstations are the ones that stsrted the revolution in user interfaces popularized by the 
ro^A? B^^'^^ ^^iTt '^o'^PUters, and since copied by the new user interface programs for the 
KofS Wind?wT' ""'^'^^ ^"'""'''^ Environment Mana^ (GEM) and 

The development of the user interface continues unabatedly at Xerox and the Star workstation 
family now supports word processing and other functions in 20 languages, including English 
French, Gemian, Russian, Chinese, Japanese and Korean. Instead of having twenty differenl 
keyboards, tiie system requires only one, witii a "virtual keyboard" displayed on the screen in the 
I'^'^'^u-^f^^l ^-^^ ""'"^^^ of languages can be used in a document^ sLe technicafdet^U^^^^^^ 
how this difficult task was accomplished have been detailed in several papers by Joseph Becker 
(see the Reviews section of this and tiie previous issue of this newsletter) 

The workstations support intermixed text and graphics, tables and forms. Each workstation 
has a large ( 024 pixels horizontally by 808 pixels%ertically), high lesolution S^Iay S 
per inch horizontal y and vertically). The keyboard has a standard typewriter layout and 24 
additional function keys. There is also, of course, a mouse. There areTeveral confutations of 
the machine with varying amounts of hard disk storage and networking capabilities A set of 
workstations can be connected together by an Ethernet LAN (local area network) 

Japanese can be entered using either roman characters (romanji) or kana. If characters are 
entered m roman characters, they are automatically converted to kana. When a word is comoStS 
die system tnes to locate all Kanji characters tiiat have the same spoken sound arthat ust tyS 
The user is tiien presented with the choices in order of highest frequency of use 
n..^K '""^ IS entered by typing in the sounds of the word in either pinyin or bopomofo. Tone 

r }t ^""^T^ '° '"^^^ ^P'"S P'°^^^^- The system then looks up all possible 
Chinese words tii«t have that same sound and the choices are again presented to the user ^ 

CA%nS::t:^^^^^^^ '''''''' '''' ^-^^^R-^' Alto, 

CFT: Phoenix 8525 Word Processor 
Description: Stand-alone word processing system for English and Arabic. 

rjyPUoK ^ "lf^°' ^'^"'^-^^oilf word processing vendor. They have an Arabic package for their 
CPT 8525 word processor. This package allows the intern Jxing of English and Arabic in tJie same 
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document The software automatically selects the appropriate form of each character as it is typed 
Documents can be pnnted on eith 2r their standard dot matrix printer or daisy-wheel printer 

For more mformation, contact CPT Corporation, 8100 Mitchell Road, PO Box 295 
Mmneapolis,MN 55440. Telephone: (612) 937-8000. ' 

Eastern Computers Iha: Graphics Boards 

Description: Apple II and IBM PC compatible boards for word processing and working with 
Chmese (other languages to follow). 5 a u lunj, wuu 

Eastern Computers, Inc. has developed series of boards that allow an Apple H or IBM PC (or 
7n nRn^^P '^''vf^^^ ^""^ in various languages. The first board can display more than 
30 000 Chmese character m a 16 by 16 dot matrix. The boards will allow programming in 
Chmese as well as word processmg. Additional boards for other languages! as well as 
improvements (mcludmg a 24 by 24 dot matrix), are under development 

For additional information, contact: Eastern Computers, Inc., 600 Lvnnhaven Parkwav 
Virgima Beach, VA 23452; telephone (804) 340-2496. ^ynnnaven i^arKway, 

Fujitsu: My OASYS Word Processor 
Description: Stand-alone word processing system for Japanese and English. 

Fujitsu is now selling their My OASYS Japanese word processor in the United States The 
machine displays Japanese characters m a 24 by 24 pixel format A basic set of 3418 kanii k 
stored m the machine (the basic JIS first level set of kanji) and it can sVoX aSiSoS 3384 use 
defined kanji. Text can be entered in either kana or romanji. Conversion to kaTirperfor^ed 
through user interaction, The system can also work in English. Basic gS can alsHe 
generated usmg the machine. A dot matrix printer is provided to^r output 

For more information, contact: Catherine Kawakami, Fujitsu America Inc 3055 Orrhard 
Drive, San Jose, CA 95134; telephone: (408) 946-8777. ^"^enca, inc., 5^55 Orchard 

Maracom: The Font Machine 
fonfs"fof printe^'^'^^^^^^^^'^^ ^^^^"^ ^^^^ °" ^" ^'^ microcomputer for developing 

Maracom Corporation has developed a font development system based on the IBM PC/AT 
This system can be used to develop fonts for a wide range of output devices including dot matrix 
printers, laser printers and other types of "dot" matrix printers. Fonts can be scanned in uS a 
video camera, fhe characters are then edited on a video monitor. Dots can be moved addid 
removed and portions of characters can be moved "lovea, aaaea, 

con^^'t te~ott&'eS^"'''' ™""f^""-« is available only on a 
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MuIUtech: DCS570 Workstation 



Description: Microcomputer system for word processing and programming in English and 
§!icu'r?entDOS^" ^'^""^ ^"'^ ^ ^'^ital Research's 

Multitech has developed a bilingual workstation that supports both the Chinese and the i^^nslish 
languages A hardware character generator holds 17,000 Chinese characters and displays them in 
nnc^ ?• operating system is a Chinese version of Digital Research's Concurrent 

DUi>, which allows multi-tasking and windows. 

Additional software avdiable includes a Chinese version of the T/Maker database package. 
The system IS based on 8088 CPU and includes 512Kbytes of RAM, two floppy drives, a serial 
Mi ^nl? keyboard and a monitor and has room for 6 IBM ?C compatible slots for 

additional cards. The base pnce IS $6250. Several options are also available 

For additional information, contact: Michael Tsai at Multitech Electronics Inc 195 W El 
f/f ?T^J Sunnyvale, C A 94086, telephone (408) 773-8400 in the US; or William Lu at 
(02 5MU01 ^^'^^ "^^'P^' "r^^^^"' R-O-C, phone 



Research .puter Technology Corp.: Alraed Computer Series 

Description: CP/M and MS-DOS compatible microcomputers for word processing and 
programming m Arabic and English. Applications software for both languages is available. 

Research Computer Technology Corporation has developed a series of three CP/M compatible 
h?7^ 80 r?r iV^ programming and use in Arabic and English. All three mode^are baS on 
rmT A '^h'P-./^J^pre model, the Alraed 600, has an Intel 8086 and an Intel 80286 
CPU A letter quality daisy-wheel pnnter is available. A wide variety of software is available 
including many programming languages, such as Basic, C and COBOL. Applications softu'are is 
available for both the Arabic and the English languages. 

For more information, contact: Research Computer Technology Corooration 3 ' 52 Kn<:hiwa 
Street, Torrance, CA 90505; telephone (213) 534-5800. ^corporation, j .52 Kashiwa 



Software Products and Services 



BKA: EISA Database 

S^ff^o?'°"*A^"u.°r'""ru'^^ -^^^^^ ^^'^'^'^ available worid-wide that covers Southeast Asian 
matenals in Ausirahan libraries. rvamn 

The BISA (Bibliographic Information on Southeast Asia) project was started in the 1970's to 
provide more access to the collection of Southeast Asian material in Australian libraries A 
comprehensive eler- database is available for use from virtually anywhere in the worid 
Indonesia is the coui ^ith the most coverage in the database, with Malaysia and Singapore 
running second. The database can be searched by author, title, sub ect or any individual keyword 
ana the results can be restncted by language, publisher, date of publication, etc.. The database 
stores information m any language. ^n^uaidudic 

BISA also publishes books and monographs on material from the catalog. They also run a 
traineeship program for hbranans on database creation and information retrieval 



Newsletter for Asian and Middle Eastern Languages on Computer, Vol 1(2) 



Page 23 



more information, contact: Helen Jarvis, Director EISA, Univer<;itv of <;vHnpv M^v 
2006, Australia; telephone: 692-2222 or 692-3164 (direct line). ^"'"'^'^^'^ °^ ^y^^'^V' ^SV 

Apanda: Chinese Word Processing System 
Description: Apple H based word processing system for Chinese. 

m^ir^^ developed a Chinese word processing system based on the Apple II computer An 
BM PC version of the softwa.-e is under development as well as Japanese Sana and KaScana 
and Korean characters set. Chai^acters are displayed in a 24 by 24 malrix bSicer sizeS 
48 IS dso supported making higher quality output possible by phot^educ iof Th' e are 6^^ 
smiplified Chmese characters provided along with 7700 classic^ characters 

meth^dTwrnVeS^^^^^^^ '''''''^ ^° Additional character entry 

770S; te'^p'Jn"^^^^^ ^"^"'^'^^ ^^^^ Suite 249, Houston, TX 

Altsys Corporation: Fontastic Font Editor 
Description: Font editor for Macintosh computen. 

Altsys Corporation has developed a font editor for the Apple Macintosh computer This editor 
allows a user to modify an existing font or create a new font The fonts ^mi tKe installed hv S 
user. These fonts are then usable as any other Macintosh font, i.eX^Tan be uL?^ Inv wo^d 
processor or applicuiions program that supports the use of fo-. s The font p^L?!-.. ^ ■ 
of fonts to and from MacPaint files. In thMaTest Se fve^^^^^^^^ f"""^' '°PJ^"S 

operation is available. This means that portiS o^^^^ 

^i^i^^i^^:^^ ^^^^ ^^^^^^^^^ 

and^tS^Thrn^^^^^^^ 

appear to have any bugs) and there e ., more fLiSffor eid 'g font " ' ""'^ 

Altsys has announced a new product which will be available in December thi<: vpar ^ f^n. 
designer that will create LaserWriter fonts. This is a very exciting pSt th^t yfu^l,;'^^^^^^ 
creation of fonts, logos and so on that can potentially look profession^ Mac^^^ fon ? tt 
^nf^J^'l^^'T f "^i"^ ^^"g'h' LaserW? r fonVare described by a^s^^^^^^^^^ 

thai ttTo^^dSl^^^^^^^^^ ^"^"^^^ (assUinTo& 

telepSM)?9T^^^^^^^^^^ ^'^^^ Corporation, PO Box 865410, Piano, TX 75086; 

iiiSiagraphics: The Asiagraphics System 
Description: Word processing software for Chinese and Japanese for JIP-&6 microcon ,uters . 
Asiagraphics has developed a multi-lingual system based on the Hewlett-Packard HP P(= 

same sound. 1.. . then selects the coixect character. Four levels of operation are available! 
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allowing more experienced users to work at a faster rate. New characters can be created by means 
of an interactive character design program. 

This system can be used with the OCLC on-line library database system. 
J^"" ^^^^}^°^^^ information, contact: Asiagraphics, 141 Mt. Sinai Avenue, Mt. Sinai NY 
11766; telephone: (516)246-8365. 



CEEDE: Instructional Materials 

Description: Instructional software packages for the Apple II microcomputer in Vietnamese 
rimong and Lao. ' 

The Center for Educational Experimentation, De /elopment and Evaluation at the University of 
Iowa offers mstructional materials in Vietnamese, Lao and Hmong (as well as many other 
programs and projects). All programs run on Apple 11+ or He computers. The programs ar- for 
language msLraction and cultural study programs. 

Many other unrelated instructional software packages and training programs are also avaUable 

(319)°35T4200 ^ ^"^^ ^^^"^^ telephone: 



DataTnuisfomis: Fontrix 
Description: Apple H and IBM PC software package to develop and print fonts. 

Data Transforms offers a graphics software package for the Apple II and the IBM PC that 
allows a much larger variety of fonts and graphics forms than usual. The program can be used to 
design forms, tables and even maps. Any text file can be read in by the program, which may nose 
some mterestmg problems m integrating another word processor with this program. It works with 
a wide vanety of dot-matnx pnnters. A font editor is included as part of the package 

They also sell sets of fonts. Among the fonts available are: Arabic, Hebrew Jaoanese 
(Hu-agana and Katagana) and Sanskrit (including all the vowels but only consonants 'in the form 
consonant + short a). 

A review of tl'.is product appeared in the 21 January 1985 issue of Moworid, on pages 41-43 
telephonT(1o3)S?oT ^^'^ Transforms, 616 Washington Street, Denver, CO 80203; 



Gulf Data: Arabriteand Arabdos 

Description: Word processing and programming packages for IBM PC compatible computers. 

Gulf Data oLers two programs: the Arabrite word processing program and Arabdos 
programming tool. Both operate on uie IBM PC and compatibles inu Ara dqos 

th. processing program allows the entry of both Arabic and English, even on 

die same line The program automatically determines the coixect form of each Arabic character 
llie commands, menus, help screens and manual are in both Arabic and English. It can print using 

Se provWeT ^""'^'^ °' ^'"^^^ °' '^"^ "'^'^^ '^"^^''^ P""'^'^' ^^'"^^^^ 'he keyboard 
«r oTf^^ ^bdos programming tool aUows programmers to develop applications in Arabic, English 

Basic, COBOL, C, Fortran and dBase II programming languages. Examples of progrj.Tis in 
several of these languages are provided. Applications developed c.n: read in English or Arabic 
text, perform the insertion of Arabic text into EngUsh and vice versa, display Arabic and/or English 
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prompts, update Arabic or English data files and output Arabic or English files to dot matrix or 
cSei'^ ^^^""^ downloadable character sets. aUowing variS in 

If a display is needed for displaying Arabic, an IBM PC compatible board is available 
9l3uTteTrhontS;^^^^ '''' ^"^^-'^^ ^hatsworth. CA 

ICL: Hanyupinyin Word Processor 
Description: Word processing software package for Chinese on IBM PCs and compatibles. . 

matnx Text can be entered m pinyin or Mandarin along with the vowel tones ^ ^ ^ 

tor more mformaion, contact: International Computerized Linguistics Inc '^RR'^ <Jn„fh 
Gessner, Houston, TX 77036; telephone: (713) 266-2179. linguistics, inc., 58)55 South 

Lexisoft: Spellbinder/Arabic 
S^StSs^g Kcti ^I'Z' ""^ °" ''^ ~mpu.ers for 

(916^573130.'"''''"''^°"' ^^'^ ^378, Davis, CA 95617; telephone: 

Proper Software: The Font Librarian 
Description: Font moving program for the Apple Macintosh 

Font, can be examined m several ways: as a string of characters as an A^riT J , u I 
marked with the key caps that produce each key, orihe use^ca^^^e L^^^^^ ^^^J^^art 

h.„ Str" 'r*'^^ "shareware" product, that is, anyone can copy it or eive it to Men* 
a„"d"/SSf?AhTp^n;^;°" ^'""^ ^ In excLge7rreciL*a'3S 

CAS7SS^Mi)1»'"' •«24, Berkeley. 
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